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METHODS FOR DETECTING COMPOUNDS WHICH MODULATE THE 
ACTIVITY OF AN LPA RECEPTOR 

THCHNICAL FIELD 

5 The present invention relates to methods for identifying and characterizing 

compounds that modulate the activity of a member of the EDG family of receptors, including 
EDG-1, EDG-2, EDG-3, EDG-4, and EDG-5 (gene sequences and amino acid sequences 
represented by SEQ ID NOS: 1, 4, 5, 6 and 7, respectively), and a similar receptor called 
PSP-24 (gene sequence and amino acid sequence represented by SEQ ID NOS: 8 and 9, 

10 respectively.) 

BACKGROUND 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the mtenor of cells. Frequently, binding 
of a ligand to a cell-surface receptor represems the first step in a cascade of events that results 
15 in a cellular response. The ligands recognized by specific receptors include a diverse array of 

molecules such as peptides, deoxyribonucleotide triphosphates and phospholipids. 

Research into phospholipid signaling is an area of intense scientific investigation, as 
more and more bioactive lipids are being identified and their actions characterized. One 
important addition to the growmg list of lipid messengers is lysophosphatidic acid (l-acyl-2- 
20 hydroxy-5«-glycero-3-phosphate, LPA), the simplest of all glycerophospholipids. While 

LPA has long been known as a precursor of phospholipid biosynthesis in both eukaryotic and 
prokaryotic cells, only recently has LPA emerged as an intercellular signaling molecule that 
is rapidly produced and released by activated cells, notably platelets, to influence target cells 
by acting on a specific cell-surface receptor. Moolenaar (1994) Trends Cell Biol. 4:213-219. 
25 Besides being synthesized and processed to more complex phospholipids m the endoplasmic 

reticulum, LPA can be generated through the hydrolysis of pre-existing phospholipids 
following cell activation. The best documented example concerns thrombin-activated 
platelets, where LPA production is followed by its extracellular release. Eichholtz et al. 
(1993) Biochem. J. 291:677-680. Platelet LPA is formed, at least in large part, through 
30 phospholipase A, (PLA2)-mediated deacylation of newly generated phosphatidic acid (PA). 

Gerrard and Robinson (1989) Biochim. Biophys. Acta 1001:282-285. Distinct PA-specific 
PLA2 activity has been identified in platelets (Ca^^dependent) and in rat brain (Ca'^- 
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independent), but little is known about its mode of regulation. Billah et al. (1981 ) J. Biol 
Chem. 256:5399-5403; and Thompson and Clark (1995) Biochem. J. 306:305-309. 

It remains to be examined at what stage of the platelet activation response LPA is 
produced and how it is released into the extracellular environment. Given the wide variety of 
5 LPA responsive cell types, LPA production and release are unlikely to be restricted to 

platelets. Indeed, there is preliminary evidence that growth factor-stimulated fibroblasts can 
also produce LPA. Fukami and Takenawa ( 1 992) J. Biol. Chem. 267:10988-10993. 
Furthermore, LPA may be formed and released by injured cells, probably due to nonspecific 
activation of phospholipases. Many other cell systems remain to be examined for LPA 

1 0 production. 

In freshly prepared mammalian serum, LPA concentrations are estimated to be in 
the range of approximately 2-20 hM, with oleoyl- and palmitoyl-LPA being the 
predominant species. Tokumura et al. (1994) Am. J. Physiol. 267:C204-C210; and 
Eichholtz et ai. (1993) Biochem. J. 291:677-680. LPA is not detectable in platelet-poor 
1 5 plasma, whole blood, or cerebrospinal fluid. Tigy i and Miledi ( 1 992) J. Biol. Chem. 

267:21360-21367. In common with long chain fatty acids, LPA binds with high affinity to 
serum albumin at a molar ratio of about 3:1. Tigyi et al. (1991) 7. Biol. Chem. 266:20602- 
20609; Thumser et al. (1994) Biochem. J. 301 :801-806. It is notable that serum albumin 
contains several other, as yet unidentified lipids (methanol-extractable) with LPA-like 
20 biological activity. Tigyi and Miledi (1992) J. Biol. Chem. 267:21360-21367. This raises 

the interesting possibility that LPA may belong to a new family of phospholipid mediators 
showing overlapping biological activities and acting on distinct receptors; conceivably, the 
ether-linked phospholipid platelet-activating factor (PAF) and the mitogenic lipid 
sphingosine 1 -phosphate may also belong to this putative family. Zhang et al. (1991) J. 
25 Ce//B/o/. 114:155-167. 

The range of biological responses to LPA is quite diverse, ranging from induction of 
cell proliferation to stimulation of neurite retraction and even slimemold chemotaxis, and the 
body of knowledge continues to grow as more and more cellular systems are tested for LPA 
responsiveness. Jalink et al. (1993) Proc. Natl. Acad Sci. U.S.A. 90:1857-1861; Jalink et al. 
30 (1993) Cell Growth and Differ. 4:247-255; and Moolenaar (1995) Curr. Opin. Cell Biol. 

7:203-210; Dyer et al. (1992) Molec. Brain Res. 14:293-301; Dyer et al. (1992) Molec. Brain 
Res. 14:302-309; Tigyi and Miledi (1992) J. Biol. Chem. 267:21260-21361. 
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Although its precise physiological and pathological functions in vivo remain to be 
explored, LPA derived from platelets has all the hallmarks of an important mediator of 
wound healing and tissue regeneration. Thus, in addition to acting as an autocnne stimulator 
of platelet aggregation, LPA stimulates the growth of fibroblasts, vascular smooth muscle 
cells, endothelial cells, and keratinocytes. Moolenaar (1994) Trends Cell Biol. 4:213-219; 
Jalink et al. (1994) Biochim. Biophys. Acta 1 198:185-196; Van Corven et al. (1989) Cell 
59:45-54; Tigyi et al. (1994) Proc. Nad. Acad. Sci. U.S.A. 91:1908-1912; Tokumura et al. 
(1994) Am. J. Physiol. 267:C204-C210; and Piazza et al. (1995) Exp. Cell Res. 216:51-64. 
Intriguingly, it has been observed that LPA acts as an inhibitor of eukaryotic DNA 
polymerase a. Murakami-Murofushi et al. (1992) J. Biol. Chem. 267:21512-21517 . LPA 
also exhibits anti-mitogenic activity toward myeloma cells, presumably through a distinct 
receptor subtype. Tigyi et al. (1994) Proc. Nail. Acad Sci. 91 :1908-1912; Murakami- 
Murofushi et al. (1993) Cell Structure and Function 18:363-370. 

In addition to stimulating cell growth and proliferation. LPA promotes cellular tension 
1 5 and cell-surface fibronectin binding, which are important events in wound repair and 

regeneration. Zhang et al. (1994) J. Cell Biol. 127:1447-1459; Kolodney et al. (1993) J. Biol. 
Chem. 268:23850-23855; and Lapetina et al. (1981) J. Biochem. 256:5037-5040. As a 
product of the blood-clotting process, LPA is a normal constituent of serum (but not platelet- 
poor plasma), where it is present in an albumin-bound form at physiologically relevant 
concentrations. Tigyi and Miledi (1992) J. Biol. Chem. 267:21360-21367; and Eichholtz et 
al. (1993) Biochem. J. 291:677-680. 

Recently, anti-apoptotic activity has also been ascribed to LPA. PCT Application No. 
PCT/US94/13649. In this sHidy, an actively proliferating cell line was rescued from serum 
withdrawal-induced apoptosis by LPA. In another study, evidence has been presented 
25 suggesting that LPA can suppress apoptosis in vitro as well as in ischemic organs such as 

heart and liver. Wu et al. (1996) Transplantation (in press). 

Apoptosis is a normal physiologic process that leads to individual cell death. This 
process of programmed cell death is involved in a variety of normal and pathogenic 
biological events and can be induced by a number of unrelated stimuli. Changes in the 
biological regulation of apoptosis also occur during aging and are responsible for many of the 
conditions and diseases related to aging. Recent studies of apoptosis have implied that a 
common metabolic pathway leading to cell death may be initiated by a wide variety of 
signals, including hormones, serum growth factor deprivation, chemotherapeutic agents, 
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ionizing radiation, and infection by human immunodeficiency virus (HIV). Wyllie (1980) 
Nature 284:555-556; Kanter et al. (1984) Biochem, Biophys. Res. Commun, 1 18:392-399; 
Duke and Cohen (1986) Lymphokine Res. 5:289-299; Tomei et al. (1988) Biochem. Biophys. 
Res. Commun. 155:324-331; Kruman et aL (1991) 7. Ce//. Physiol 148:267-273; Ameisen 
and Capron(1991)/wwW. Today 12:102-105; and Sheppardand Ascher (1992) J. AIDS 
5: 143-147. Agents that affect the biological control of apoptosis thus have therapeutic utility 
in numerous clinical indications. 

Cellular shrinkage, chromatin condensation, cytoplasmic blebbing, increased 
membrane permeability and interchromosomal DNA cleavage characterize Apoptotic cell 
death. Gerschenson et al. (1992) FASEB J. 6:2450-2455; and Cohen and Duke (1992) Ann. 
Rev. Immunol. 10:267-293. The blebs, small, membrane-encapsulated spheres that pinch off 
of the surface of apoptotic cells, may continue to produce superoxide radicals which damage 
surrounding cell tissue and may be involved in inflammatory processes. 

While apoptosis is a normal cellular event, pathological conditions and a variety of 
injuries can also induce it. Apoptosis is involved in a wide variety of conditions, including, 
but not limited to, cardiovascular disease; cancer regression; immune disorders, including, 
but not limited to, systemic lupus erythematosus: viral diseases; anemia; neurological 
disorders; diabetes; hair loss; rejection of organ transplants; prostate hypertrophy; obesity; 
ocular disorders; stress; aging; and gastrointestinal disorders, including, but not limited to, 
diarrhea and dysentery. In the myocardium, apoptotic cell death follows ischemia and 
reperfusion. 

In Alzheimer's disease, Parkinson's disease, Huntington's chorea, epilepsy, 
amyotrophic lateral sclerosis, stroke, ischr nic heart disease, spinal cord injury and many 
viral infections, for example, abnormally high levels of cell death occur. In at least some of 
these diseases, there is evidence that the excessive cell death occurs through mechanisms 
consistent with apoptosis. Among these are I) spinal cord injury, where the severing of 
axons deprives neurons of neurotrophic factors necessary to sustain cellular viability; 2) 
stroke, where after an initial phase of necrotic cell death due to ischemia, the rupture of dead 
cells releases excitatory neurotransmitters such as glutamate and oxygen free radicals that 
stimulate apoptosis in neighboring healthy neurons; and 3) HIV infection, which induces 
apoptosis of T-lymphocytes. 

In contrast, the level of apoptosis is decreased in cancer cells, which allows the cancer 
cells to survive longer than their normal cell counterparts. As a result of the increased 
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number of surviving cancer cells, the mass of a tumor can increase even if the doubling time 
of the cancer cells does not increase. Furthermore, the high level of expression in a cancer 
cell of the bcl-2 gene, which is involved in regulating apoptosis and, in some cases, necrotic 
cell death, renders the cancer cell relatively resistant to chemotherapeutic agents and to 
5 radiation therapy. 

There is considerable evidence of plasma membrane receptors for LPA. LPA-binding 
proteins have been reported in mammalian tissues and labeled using a photoaffinity 
crosslinker derivative, Lihom et al. (1996) Am. 1 Physiol 270:C772.C778, Thomson et al. 
(1994) Mo/. Pharmacol 45:718-723; and van der Bend et al. (1992) EMBO J. 11:2495-2501. 
10 In A^. laevis oocytes, LPA elicits oscillatory CI" currents. Tigyi and Miledi (1992) J. Biol 

Chem. 267:21360-21367. This current, like other effects of LPA, is consistent with a plasma 
membrane receptor-mediated activation of G protein-linked signal transduction pathways. 

G proteins are comprised of three subunits: a guanyl-nucleotide binding a subunit; a (3 
subunit; and a y subunit. G proteins cycle between two forms, depending on whether GDP or 
1 5 GTP is bound thereto. When GDP is bound the Ga3Y protein exists as an inactive 

heterotrimer, the GapY complex. When GTP is bound the a subunit dissociates, leaving a Gpy 
complex. Importantly, when a GapY complex operatively associates with an activated G 
protein coupled receptor in a cell membrane, the rate of exchange of GTP for bound GDP is 
increased and, hence, the rate of dissociation of the bound the a subunit from the GpY 
20 complex increases. This fundamental scheme of events forms the basis for a multiplicity of 

different cell signaling phenomena. 

At least four G protein-mediated signaling pathways have been identified in the action 
of LPA. These are: 1) stimulation of phospholipase C rnd phospholipase D; 2) inhibition of 
adenylyl cyclase; 3) activation of Ras and the downstream Raf/MAP kinase pathway; and 
25 4) tyrosine phosphorylation of focal adhesion proteins in concert with remodeling of the actin 

cytoskeleton in a Rho-dependent manner. 

GTP-binding proteins fall into two broad classes of regulatory proteins; the 
heterotrimeric G-proteins, and small GTPases, The alpha subunit of heterotrimeric G- 
proteins (Ga ) and the small GTPases, as typified by the proto-oncogene Ras, share certain 
30 structural homology, and cycle between an active GTP-bound state and an inactive GDP- 

bound state. When stimulated by an appropriate signal, G-proteins and small GTPases 
become activated by the binding of GTP and physically interact with effector molecules to 
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..ansduce the signal to the cell. In the case of G-prote.ns, b.d.g of GTP to the a subunU 
causes the low molecular weight Ga to dissociate from the CPy dimer where either Ga or 
GPy can act as the signal transducer. An intrinsic GTPase activity hydrolyses GTP to GDP 
and thus attenuates the signal. Ancillary proteins collectively known as exchange factors are 
responsible for replacmg GDP for GTP and reactivating the GTP-binding protein. 
Heterotrimeric G-protein coupled receptors are a special class of receptors. It is estimated 
that G-protein coupled receptors comprise 0A% of the human genome ( including olfactory 
and visual receptors) which could place the number of different receptors in the thousands. 
The common structural feature of these receptors are seven hydrophilic membrane spanmng 
domains Based on the three dimensional model of bacterial rhodopsin, it is predicted that 
the receptors would form a barrel shaped structure with the ligand binding domains being the 
extracellular loops and/or the transmembrane domains. 

Recently, three putative receptors for LPA have been identified suggesting that 
functionally different LPA receptors may exist that dictate the particular cellular response ot 
LPA Hecht, J, H., et al. (1996) J. Cell. Biol. 135,4), 1071-1083; Macrae, A. D., et al. (1996) 
Mol Brain Res. 42, 245-254; An, S.. et al. (1997) Biochm. Biophys. Res. Com. 231, 619- 
622- Guo Z., et al, (1996) Proc. Natl. Acad. Sci. USA 93, 14367-14372; An, et al., J. Biol. 
Chem (1998). Most cell types respond to LPA making ,t difficult to characterize the 
receptor dependency of a particular response to LPA since the response cannot be solely 
attributed to a single LPA receptor. In particular, it ,s difficult to assess ligand binding 
specificity of an LPA receptor without a naive cell line because other lipid receptors may 
exist with overlapping ligand specificity. Therefor, the yeast Saccharo.yces cere.is.ae was 
used to study the human LPA receptor EDG-2 (or Vzg-1). S. cere.is^ne contain no 
endogenous LPA receptors and is therefore a potentially useful organism in which to 
functionally express LPA receptors and analyze their ligand specificity. Other mammahan 
receptors have been functionally expressed m Saccharomyces including the sommatostatm 
receptor (Price, L. A., et al. (1995) Mol. Cell. Biol. 15(1 1), 6188-6195), the A2a adenosine 
receptor (Price, L. A., et al. (1996) Mol. Pharmacol. 50(4), 829-837) and the p2-adrenergic 
receptor (King, K., et al., (1990) Science 250, 121-123). 

Figure I shows a detailed schematic of the yeast pheromone-inducible MAP 
Kinase cascade. Saccharomyces contains a single heterotrimeric G-protein that is activated 
by mating factor binding to a specific receptor. Blumer, K. J., and Thomer, J. (1990) Proc. 
Natl Acad. Sci. USA 87, 4363-4367. Upon stimulation by an occupied receptor, the a 
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s„b„nU ot.h= he,e,ommeric 0 pro>=.n (Ga, .h. GFA> B»e produc, (D.euel, C, and K„„an 
, ,,987, Cell 50, lOOMOlO; Miyaj,™, U e, al. (.987, Cel. 50, ,0. ...0,9) becomes bound 
,0 OTP and dissociates from .he p, dimer. .n yeas., i. is te Py dime, ,ha, transduces .he 
signal .0 S.e. 1 (.he MEKK e<,u,va.en. (Lange-Caner, C, A., e, al. (.993, Science 260, 315- 
,„)) and S.e7 (.he MEK =,u.va.=n. (Nerm™, A. M., and Herstow,,., 1, (.994, Free. Na.i 
Acad Sci USA 91, 3398.3402),. The act.ve GXP-bound version of Ga is i.ac.,va.ed by 
hvdroiysis of GTP .0 ODP a. wh,ch .,me, Ga can re-associale w„h Of, and a,.enua.es .he 
s;gna, (Bhndcr, D, and lenness, D. D. (.989) «»/ Ce„. Bioi 9, 3720-3726; Cole, 0, ,1990, 
Mo/ Cell Bio,. ,0(5,0-517); Dieae.,C., and Kurjan, J. (1987) ca SO, .OO.-.O.O; 
Miya,ima 1 , e. al. (1987) Cell 50, 1011-1019,. Uke .he mammalian MAP kinase, .he yeas. 
MAP kinases FusI and Kssl ac.,va>e a „anscnp„onal ac.iva.or, .he STE,2 gene prodnc. 
,El,on E. A., e. al. ,1994) Mo/. B,ol CelH, 495-510). Ac.ivaied S.el2 in .urn ac„v,.es,he 
.,anscrip.,on of several maring-mducible genes such as TOS/lElion, E. A., e. al. ,1991) Co/. 

n , RW-^fi 41-49' Peter M etal. (1993) Ce// 73, 747-760). To 
Spring Harbor Symp. Quant. Biol. 56,^1-4^, i-^^^^y^-^^' v 

study the EDG-2 receptor using the yeast pheromone response pathway system, a stram 
carrying a mutation in the FAR! gene was used. This mutation has the effect of uncoupling 
the MAP kmase cascade from cell cycle arrest allowmg the yeast to contmue growmg durmg 
MAP kmase activation (Peter, M., et al. (1993) Cell 73, 747-760; Peter, M., and HerskowUz, 
I (1994) Science 265, 1228-1231). Secondly, a mutaUonally inactivated 5572 gene was 

created to mcrease the sensitivity of the stram to G-protein activation. The SST2 gene 
encodesaGTPase activating protein(GAP)fortheGasubunit,theG../ gene product, 

(Dohlman, H. G., et al. (1996) Mol. Cell. Biol. 16(9), 5194-5209). By inactivating the SST2 
gene product, Ga remains in the GTP-bound state longer and thus increases the ste, dy-state 
concentration of the signal transducing dimer. Finally, to quantify the response, the 
bacterial lacZ gene was fused to the mating inducible FUSI promoter to create a reporter 

gene. 

The ubi<,u,.ous presence of Ae response ehci.ed by LPA in almos. every cell hnc 
,es.ed, combined wiU, ^ amphiphihc charaner of LPA .ha, makes rad.o.igand brndmg 
assays ex»emcly difT.cul., has prescnfed considerable diffcuLies in .he molecular c.omng of 
i,s receprors. In view of .he po«,ia. physrologica. signifcance of LPA receptors m .erms of 
wound healing, ceil regenerai.on and cell prol,fera.ion and apop.osis, dtere ,s a need for drug 
screening assays exhibiting mcreased specificity that facilriate the search for agonists, inverse 
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agonists, or antagorusts of LP A, as well as methods for screening analogues of LPA to 
determine their ability to activate EDG-2, for elucidating the pharmacological properties of 

these proteins. 

The present invention addresses this need. Herein are described methods of screening 
5 for agonists or antagonists of EDG-l. EDG-2, EDG-3, EDG-4, and EDG-5, as well as 

methods of counter screening for agonists or antagonists that are specific for only one of 

these EDG receptors. 

All references cited herein are incorporated by reference in their entirety. 

^1 IMMARY OF THE INVENTION 
1 0 Methods of screening for pharmaceutical agents that stimulate, as well as 

pharmaceutical agents that inhibit, EDG-l, EDG-2, EDG-3, EDG-4, and EDG-5 activity are 
provided. 

The present invention encompasses a method for identifying compounds which 
modulate the activity of any of the EDG receptors, comprising the steps of: a) contacting 
1 5 recombinant host cells, modified to contain the DN A of SEQ. ID. NO. 1 , 4. 5, 6, 7 or 8, 

which is operably linked to control sequences for expression, with at least one compound or 
signal whose ability to modulate the activity of the EDG receptor is sought to be determined, 
and b) analyzing the cells for a difference in functional response mediated by said receptor. 
More specifically, the present invention encompasses contacting said cells with at least one 
20 composition whose ability to modulate the activity of said receptor is sought to be 

determined, and monitoring said cells for a change in the level of a particular signal 
associated with activation of the EDG receptor. EDG receptors encompassed by the present 
invention include EDG-l, EDG-2, EDG-3, EDG-4, and EDG-5. An additional receptor 
encompassed by the present invention is PSP-24, a receptor of LPA discovered in mice, 
25 which can be used as a screen to evaluate the specificity of a particular ligand for any of the 

EDG family of receptors. For purposes of the present discussion, PSP-24 shall be 
encompassed by the expressions "EDG family of receptors" and "EDG related receptors, 
because it has similarities, including being an LPA receptor. 

Additionally, the present invention contemplates a method for modulating the signal 
30 transduction activity of the EDG receptor, comprising contacting said receptor with an 

effective amount of at least one compound identified by the method described above. 
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The present invention also encompasses an agonist, antagonist, inverse agonist ,or 
ailosteric modulator identified by the above methods. 

In an alternative embodiment, the present invention encompasses a method for 
detecting an agonist, antagonist, mverse agonist, or ailosteric modulator of an EDG receptor 
having activity comprising the steps of: a) exposing a compound to an EDG receptor 
coupled to a response pathway, under conditions and for a time sufficient to allow interaction 
of the compound with the EDG receptor and an associated response through the pathway, 
and b) detecting an increase or a decrease in the stimulation of the response pathway resulting 
from the interaction of the compound with the EDG receptor, relative to the absence of the 
tested compound and therefrom determining the presence of an agonist, antagonist, inverse 
agonist, or ailosteric modulator. 

In yet another embodiment, the present invention encompasses a method for detecting 
an LPA agonist, antagonist, inverse agonist, or ailosteric modulator of LP A receptor 
comprising the steps of a) exposing a compound to the EDG-2 receptor coupled to a response 
1 5 pathway, under conditions and for a time sufficient to allow interaction of the compound with 

the EDG-2 receptor and an associated response through the pathway, and b) detecting an 
increase or a decrease in the stimulation of the response pathway resulting from the 
interaction of the compound with the EDG-2 receptor, relative to the absence of the tested 
compound and therefrom determining the presence of an agonist, antagonist, inverse agonist, 

20 or ailosteric modulator. 

In yet another embodiment, the invention encompasses a method for detecting inverse 
agonists of LPA, comprising the steps of a) exposing a compound and LPA to the EDG-2 
receptor coupled to a response pathway, under conditions and for a time sufficient to allow 
interaction of LPA with the EDG-2 receptor and an associated response through the pathway, 

25 and b) detecting an increase or a decrease in the stimulation of the response pathway, relative 

to the absence of the tested compound and therefrom determining the presence of an inverse 
agonist of LPA. 

In yet another embodiment of the present invention, a method of detecting compounds that 
modulate the interaction between a ligand of an EDG related receptor and the EDG related 
receptor is encompassed, comprising: exposing a labeled ligand of an EDG related receptor to 
a cell expressing said EDG related receptor; exposing a labeled compound that is believed to 
interact with an EDG related receptor to said cell, and detecting a change in the amount of 
labeled ligand bound to said cell. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 shows a schematic of the yeast pheromone-inducible MAP Kinase cascade. 
Components of this pathway {SST2 and FARl) that were genetically inactivated by mutation 
are identified by underlines. 

Figure 2 is a graph depicting EDG-2-mediated stimulation of FUS! ::lacZ. Yeast cells 
carrying the galactose-inducible edg-2 gene were grown in SC media containing either 2% 
galactose (filled bars) or 2% glucose (hatched bars) for seven hours in the presence of 
lysophosphatidic acid (LPA) or on galactose in the absence of LPA (open bars). After seven 
hours, the cells were assayed for p-galactosidase (lacZ) activity. 

Figures 3 A and 3B are graphs depicting the stimulation by LPA of FUS l::IacZ 
activity in cells expressing EDG-2 in a time and dose dependent manner, respectively. 3 A: 
Yeast cells carrying edg-2 (■) or empty vector (♦) were grown in synthetic complete media 
(S.C.) + 2% galactose for the indicated time prior to assaying p-galactosidase activity. 3B, 
Yeast cells carrying edg-2 were grown for seven hours at the indicated dose of LPA. 

Figure 4 is a graph depicting the specific activation by LPA, but not other related 
lysophospholipids or Sphingosine-1 -phosphate (SPP), of FUSl::lacZ, Yeast cells carr>^ing 
edg-2 were grown in S.C. + 2% galactose in the presence of LPA (■), LPC (♦), LPE (•), 
LPG (A), LPS (□) or Sph-l-P (O) at the indicated concentrations for seven hours. All lyso- 
glycerophospholipid were resuspended in BBS/EDTA + Img/ml fatty acid free bovine serum 
albumin to enhance solubility. 

Figure 5 is a graph depicting the specific activation of FUSlwlacZhy LPA, but not 
Diacyl-glycerophospholipids. Yeast cells were cultured in S.C. + 2% galactose in the 
presence of PA (■), PC (♦), PE (•), PG (A) PS (□) or LPA (O) at the indicated 
concentration for seven hours. All diacyl-glycerophospholipid were resuspended in 
BBS/EDTA + Img/ml fatty acid free bovine serum albi.imin to enhance solubility. 

Figure 6 is a graph depicting the effect of the fatty acid side-chain of LPA on 
activation of FUSl::lacZ. Yeast cells expressing EDG-2 were cultured in S.C. + 2% 
galactose in the presence of 18:1 oleoyl LPA (■), 18:0 steroyl LPA (♦), 16:0 palmitoyl LPA 
(•) or 14:0 (A) at the indicated concentration for seven hours. The numerical representation 
refers to the chain length and degree of saturation. All forms of LPA were resuspended in 
BBS/EDTA + Img/ml fatty acid free bovine serum albumin to enhance solubility. 
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Figure 7 is a graph depicting the activation of FUSl::lacZ by LP A presented either as 
a liposomal formulation or as freely soluble LPA. Yeast cells carrying EDG-2 were culture 
m S.C. + 2% galactose in the presence of freely soluble LPA (■), LPA + PC liposomes (♦), 
PC alone ( O), LPA + PG liposomes (A) or PG alone (□). Note that the concentration of 
5 lipid reflects only the LPA component of the liposome. 

MODES FOR CARRYING OUT THE INVENTION 
The present invention encompasses methods for detecting substances that modulate 
EDG receptor activity. Additionally, the present invention encompasses compositions 
comprising a host organism that does not contain an endogenous LPA receptor and that has 
1 0 been transfected with an EDG receptor. EDG receptors encompassed by the present 

invention include EDG-U EDG-2, EDG-3, EDG-4, and EDGo. An additional receptor 
encompassed by the present invention is PSP-24, which can be used as a screen to evaluate 
the specificity of a particular ligand for any of the EDG family of receptors. 

The edg-2 gene product, a lysophosphatidic acid (LPA) receptor, also reported as vzg- 
15 1 , couples to the yeast heterotrimeric G-protein and activates a MAP kinase cascade- 

dependent reporter. The response to LPA can be quantitated by using a reporter gene, 
including, without limitation, the lacZ gene or the luc gene fused to the FUSl promoter, a 
mating pheromone-inducible gene promoter, the HISS gene, or any other gene that can 
genetically comphment an auxotropic growth mutation. The yeast strain used is able to grow 
20 in the presence of activated G-protein due to a mutation in tlie FARl gene. This mutation has 

the phenotypic effect of uncoupling G-protein/map kinase activation from cell cycle arrest. 

The following definitions are for the purpose of clarifying the terms used herein, and 
are not meant to be limiting. 

Vectors usefiil for practicing the present invention include plasmids, viruses 
25 (including phage), and integratable DNA fragments (i.e., fragments integratable into the host 

genome by homologous recombination). The vector may replicate and function 
independently of the host genome, as in the case of a plasmid, or may integrate into the 
genome itself, as in the case of an integratable DNA fragment. Suitable vectors will contain 
replicon and control sequences that are derived from species compatible with the intended 
30 expression host. For example, a promoter operable in a host cell is one which binds the RNA 

polymerase of that cell, and a ribosomal binding site operable in a host cell is one which 
binds the endogenous ribosomes of that cell. 
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DN A regions are "operably" associated when they are ftinctionally related to each 
other For example; a promoter is operably linked to a coding sequence if it controls the 
transcription of the sequence; a ribosome binding site is operably linked to a coding sequence 
if it is positioned so as to permit translation. 

Heterologous DN A sequences are expressed in a host by means of an expression 
vector An expression vector is a replicable DNA construct in which a DNA sequence 
encoding the heterologous DNA sequence is operably linked to suitable control sequences 
capable of effecting the expression of a protein or protem subunit coded for by the 
heterologous DNA sequence in the intended host. Generally, control sequences include a 
transcriptional promoter, an optional operator sequence to control transcription, a sequence 
encoding suitable mRNA nbosomal binding sites, and (optionally) sequences which control 
the termination of transcription and translation. 

Transformed host cells of the present invention are cells which have been transformed 
or transfected with the vectors constructed using recombinant DNA techniques and express 
the protein or protein subunit coded for by the heterologous DNA sequences, 

A variety of yeast cultures, and suitable expression vectors for transforming yeast 
cells are known. See, e.g., U.S. Pat. No. 4,745,057; U.S. Pat. No. 4,797,359; U.S. Pat. No. 
4 615 974; U.S. Pat. No. 4,880,734; U.S. Pat. No. 4,71 1 ,844; and U.S. Pat. No. 4,865,989. 
Saccharomyces cere.isiae is the most commonly used among the yeast, although a number of 
other strains are commonly available. See, e.g., U.S. Pat. No. 4,806,472 iKluveromyces lacns 
and expression vectors therefor); U.S. Pat. No. 4,855,231 {P^chia pastons and expression 
vectors therefor). Also, any species Candida can be used. Yeast vectors may contain an 
origin of replication from the 2 micron yeast plasmid or an autonomously replicating 
sequence (ARS), a promoter, DNA encoding the heterologous DNA sequences, sequences tor 
polyadenylation and traiiscription termination, and a selection gene. An exemplary plasmid is 
YRp7, (Stmchcomb et al.. Nature 282, 39 (1979); Kingsman et al., Gene 7, 141 (1979); 
Tschemperetal.,Gene 10, 157 (1980)). This plasmid contains the rrp7 gene, which provides 
a selection marker for a mutant strain of yeast lacking the abihty to grow in tryptophan, for 
example ATCC No. 44076 or PEP4-1 (Jones, Genetics 85, 12 (1977)). The presence of the 
trpl lesion ,n the yeast host cell genome then provides an effective environment for detecting 
transformation by growth in the absence of tryptophan. 

Suitable promoting sequences in yeast vectors include, but are not limited to. the 
promoters for metallothionein. 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem. 



12 



wo 99/19513 



PCT/US98/71315 



10 



15 



20 



25 



30 



255 2073 (1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg. 7, 149 
(1968); and Holland et al.. Biochemistry 17, 4900 (1978)), such as enolase, glyceraldehyde- 
3-phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokmase. 
glucose-6-phosphate tsomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase, and glucokmase. Suitable vectors and promoters for 
use in yeast expression are further described in R. Hitzeman et al., EPO Publn. No. 73,657. 
Other promoters, which have the additional advantage of transcnption controlled by growth 
conditions, are the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen metabohsm, and the 
aforementioned metallothionem and glyceraldehyde-3-phosphate dehydrogenase, as well as 
enzymes responsible for maltose and galactose utilization. 

In constructing suitable expression plasmids, the termination sequences associated 
with these genes may also be ligated into the expression vector 3' of the heterologous codmg 
sequences to provide polyadenylation and termination of the mRN A. 

A "reporter gene" is a gene that is operably linked to control sequences for expression 
of a gene of interest, and that encodes a gene product that can be detected for the purpose of 
evaluating expression of the associated gene of interest. 

The "active state" of a receptor is the state at which the ligand that stimulates the 
receptor can bind to activate a signaling pathway controlled by the receptor. 

An "EDG receptor agonist" is defined herein as a composition that is capable of 
combining with the active state of an EDG receptor to up-regulate a signaling pathway 
controlled by an the receptor. An "EDG antagonist" is defined herein as a composition that is 
...pable of combimng with the EDG receptor in either the active or inactive state, thereby 
impeding the biological action of the receptor. An "inverse agonist" of EDG is defined 
herein as a composition that is capable of combining with the inactive state of an EDG 
receptor to down-regulate a signaling pathway controlled by the receptor. 

A composition that "modulates" the activity of an EDG receptor is defined herein as a 
composition that is capable of evoking a change m the functional response mediated by said 
receptor. 

AS used hOTin. .he terms "puiified" o. "isolated" are intended to refer to a .molecule 
used in .he present tnven.ion ,n an enriched or pur. fonn obtainable from a natural source or 
bv means of genaic engmeertng or syn.he.ic chemistry. The purif ed protein, DNA or RNA 
of the invention ma, be useful in ways that protem, DNA ar,d RNA as they m.urall, 
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occur are not, such as identification of compounds selectively modulating the expression or 
the activity of the EDG-2 of die invention. 

The isolated polypeptide and polypeptide fragments of an EDG means the EDO 
which is free of one or more components of its natural environment. Purified EDG includes 
purified EDG in recombinant cell culture. The enriched form of the receptor refers to a 
preparation containing said receptor in a concentration higher than natural, or in a cell where 
it is not found under native conditions e.g., a cellular membrane fraction comprising said 
receptor. If the receptor is in a pure form it is substantially free from other macromolecules, 
particulariy from naturally occurring proteinaceous contamination. If desired, the receptor 
may be solubilized. Preferably, the receptor of the invention is in an active state meaning that 
it has both ligand binding and signal transduction activity. Receptor activity is measured 
according to methods known in the art, e.g., using a binding assay or a functional assay, e.g., 
an assay as described below. 

The invention is further intended to encompass variants of the receptor of the 
invention. For example, a variant of an EDG receptor of the invention is a functional 
equivalent of said receptor. A functional equivalent is a protein displaying a physiological 
profile essentially identical to the profile characteristic of the particular member of the EDG 
family of receptors having the amino acid sequence set forth in SEQ ID N0S:1, 4, 5, 6, 7 or 
8. The physiological profile in vitro and in vivo includes receptor effector function, 
electrophysiological and pharmacological properties, e.g., selective interaction with agonists 
or antagonists. Exemplary functional equivalents may be amino acid mutants including those 
having amino acid deletions, substitutions or insertions, and glycosylation variants. 
Functional equi valents may also include EDG receptors from a different organism. The 
present invention also encompasses methods for comparing the agonist profile of other EDG 
related receptors suchasEDG-1 (Lee, M.-J., et al. (1996) J. Biol. Chem. 271(19), 1 1272- 
1 1279; Hla, T., and Maciag, T. (1990) J. Biol Chem. 265(16), 9308-9313), EDG-3 
(Yamaguchi, F., et al. (1996) Biochem. Biophys. Res. Comm. Ill, 608-614) and H218 
(Okazaki, H., et al. (1993) Biochem. Biophys. Res. Com. 190, 1 104-1 109; MacLennan, A. J., 
et al. (1994) Moi. Cell. Neurosci. 5, 201-209) as well as the Xenopus high-affinity LP A 
receptor, PSP-24 (Guo, Z., et al. (1996) Proc. Natl Acad. ScL USA 93, 14367-14372). 

Covalent derivatives include, for example, aliphatic esters or amides of a receptor 
carboxyl group, 0-acyl derivatives of hydroxy I group containing residues and N-acyl 
derivative of amino group containing residues. Such derivatives can be prepared by linkage 
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of functionalities to reactable groups which are found in the side chains and at the N- and C- 
terminus of the receptor protein. Polypeptides of this invention may be modified post- 
translationally (e.g., acetylation or phosphorylation). 

The invention also encompasses methods wherein an EDG receptor is conjugated to a 
5 label capable of producing a detectable signal or other functional moieties. Suitable labels 

include, but are not limited to, radionuclides, enzymes, substrates, cofactors, inhibitors, 
fluorescent dyes, chemiluminescent dyes, bioluminescent compounds and magnetic particles. 

A protein for use in the invention is obtainable from a natural source, e.g., by 
isolation! by chemical synthesis or by recombinant techniques. 
1 0 The invention further provides a method for expressing the EDG receptors in host 

cells. Suitable host cells producing the receptor of the invention are multiplied in vitro or in 
vivo. Preferably, the host cells are transformed (transfected) with a hybrid vector comprising 
an expression cassette comprising a promoter and an edg DNA sequence. Subsequent to 
expression of the edg gene under control of the promoter, the specific EDG protein may be 
1 5 recovered. Recovery comprises, e.g., isolating the host cells comprising the receptor, e.g., 

from the culture broth. 

The term ''functionar or ^'biologically active", when used herein as a modifier of an 
EDG, refers to a polypeptide that is able to produce one or more of the functional 
characteristics exhibited by that specific native EDG. In one embodiment, functional means 
20 capable of binding its specific EDG ligand. In another embodiment, functional means that a 

signal is transduced as a consequence of binding of a ligand. 

Suitable host cells include eukaryotic cells, e.g., animal cells, plant cells and fungi, 
and prokaryotic cells, such r > Gram-positive and Gram-negative bacteria, e.g., Escheria coli. 
Isolated polynucleotides (or nucleic acids) encoding a polypeptide substantially 
25 identical to an EDG protein or portions thereof are designated edg. The term polynucleotide 

as used herein, may be DNA or RNA, either coding or noncoding strands, edg cDNA, 
genomic DNA and synthetic or semi-synthetic DNAs and RNAs. 

The invention includes modifications to edg DNA such as deletions, substitutions and 
additions particularly in the non-coding regions of genomic DNA. Such changes are useful 
30 to facilitate cloning and modify gene expression in methods of the present invention. 

Various substitutions can be made within the coding region that either do not alter the 
amino acid residues encoded or result in conservatively substituted amino acid residues. 
Nucleotide substitutions that do not alter the amino acid residues encoded are useful for 
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optimizing gene expression in different systems. Suitable substitutions are knovvn to those of 
skill in the art and are made, for instance, to reflect preferred codon usage in the particular 
expression systems. 

The invention encompasses methods using functionally equivalent variants and 
5 derivatives of a particular edg which may enhance, decrease or not significantly affect the 

properties of the resultant EDG. For instance, changes in the DNA sequence that do not 
change the encoded amino acid sequence, as well as those that result in conservative 
substitutions of amino acid residues, one or a few amino acid deletions or additions, and 
substitution of amino acid residues by amino acid analogs are those which will not 
1 0 significantly affect its properties, such as its ability to bind to LPA or analogs thereof 

An EDG is preferably incorporated into a vector (a virus, phage, or plasmid) which 
can be introduced by transfection or infection into a cell. The vector preferably includes one 
or more expression control sequences, in which case the cell transfected by the vector is 
capable of expressing the polypeptide. By "isolated DNA" is meant a single- or double- 
1 5 stranded DNA that is free of the genes that, in the naturally occurring genome of the animal 

from which the isolated DNA is derived, flank the edg gene. The term therefore includes, for 
example, either or both strands of an edg cDNA or an allelic variant thereof; a recombinant 
DNA which is incorporated into a vector, into an autonomously replicating plasmid or virus, 
or into the genomic DNA of a prokaryotic or eukaryotic cell; or a genomic DNA fragment 
20 {e.g. , produced by PGR or restriction endonuclease treatment of human or other genomic 

DNA). The term also includes a recombinant DNA that is part of a hybrid gene encoding 
additional polypeptide sequence. 

Stringent conditions for both DNA/DNA and DNA/RNA hybridization assays are as 
described by Sambrook et al.. Molecular Cloning, A Laboratory Manual, 2nd Ed., Cold 
25 Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1989, herein incorporated by 

reference. For example, see page 7.52 of Sambrook et al. 

Given the guidance of the present invention, the nucleic acids used in the invention 
are obtainable according to the methods well known in the art. 

For example, a DNA used in the invention is obtainable by chemical synthesis, by 
30 recombinant DNA technology or by PGR. Preparation by recombinant DNA technology may 

involve screening a suitable cDNA or genomic Ubrary. A suitable method for preparing a 
DNA or of the invention may, e.g., comprise the synthesis of a number of oligonucleotides, 
their amplification by PGR methods, and their splicing to give the desired DNA sequence. 
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Suitable libraries are commercially available or can be prepared from individual tissues or 
cell lines. 

For an individual receptor related to an EDG, the expression pattern in different 
tissues may vary. Thus, in order to isolate cDNA encoding a particular EDG-related 
5 receptor, it is advantageous to screen libraries prepared from different suitable tissues or 

cells. As a screening probe, there may be employed a DNA or RNA comprising substantially 
the entire coding region of the edg or a suitable oligonucleotide probe based on said DNA. A 
suitable oligonucleotide probe (for screening involving hybridization) is a single stranded 
DNA or RNA that has a sequence of nucleotides that includes at least 14 contiguous bases 
1 0 that are the same as (or complementary to) any 1 4 or more contiguous bases set forth in SEQ 

ID NO: 1, 4, 5, 6, 7 or 8, and preferably at least 1 8 contiguous bases that are the same as any 
1 8 or more contiguous bases set forth in SEQ ID NO: 1 , 4, 5, 6, 7 or 8. The probe may be 
labeled with a suitable chemical moiety for ready detection. The nucleic acid sequences 
selected as probes should be of sufficient length and be sufficiently unambiguous so that false 
1 5 positive results are minimized. 

Preferred regions from which to construct probes include 5' and/or 3' coding 
sequences, sequences predicted to encode ligand binding sites, and the like. For example, 
either the full-length cDNA clone disclosed herein or fragments thereof can be used as 
probes. Preferably, nucleic acid probes of the invention are labeled with suitable label means 
20 for ready detection upon hybridization. For example, a suitable label means is a radiolabel. 

The preferred method of labeling a DNA fragment is by incorporating ^^P-labeled a-dATP 
with the Klenow fragment of DNA polymerase in a random priming reaction, as is well 
known in the art. Oligonucleotides are usually cnd-label^d with ^^P-labeled y-ATP and 
polynucleotide kinase. However, other methods (e.g., non-radioactive) may also be used to 
25 label the fragment or oligonucleotide, including, e.g., enzyme labeling and biotinylation. 

After screening the library, e.g., with a portion of DNA including substantially the 
entire edg gene or a suitable oligonucleotide based on a portion of said DNA, positive clones 
are identified by detecting a hybridization signal; the identified clones are characterized by 
restriction enzyme mapping and/or DNA sequence analysis, and then examined, e.g., by 
30 comparison with the sequences set forth herein, to ascertain whether they include a full length 

edg gene (i.e., if they include translation initiation and termination codons). If the selected 
clones are incomplete, they may be used to rescreen the same or a different library to obtain 
overlapping clones. If the library is genomic, then the overlapping clones may include exons 
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and introns. If the library is a cDNA library, then the overlapping clones will include an open 
reading frame. In both instances, complete clones can be identified by comparison with the 
DNAs and deduced amino acid sequences provided herein. 

It is envisaged that the nucleic acid of the invention can be readily modified by 
5 nucleotide substitution, nucleotide deletion, nucleotide insertion or inversion of a nucleotide 

stretch, and any combination thereof Such modified sequences can be used to produce a 
mutant EDG that differs from the receptors found in nature. Mutagenesis may be 
predetermined (site-specific) or random. A mutation that is not a silent mutation should not 
place sequences out of reading frames and preferably will not create complementary regions 
1 0 that could hybridize to produce secondary mRN A structures such as loops or hairpins. 

The edg cDNA or genomic DNA can be incorporated into vectors for transfection of a 
host cell. Furthermore, the invention concerns a recombinant DNA which is a hybrid vector 
comprising at least one of the above mentioned genes. 

The hybrid vectors of the invention comprise an origin of replication or an 
1 5 autonomously replicating sequence, one or more dominant marker sequences and, optionally, 

expression control sequences, signal sequences and additional restriction endonuclease sites. 

Preferably, the hybrid vector of the invention comprises an above described nucleic 
acid insert operably linked to an expression control sequence, in particular those described 
hereinafter. 

20 Vectors typically perform two functions in collaboration with compatible host cells. 

One function is to facilitate the cloning of the edg gene, i.e., to produce useable quantities of 
the nucleic acid (cloning vectors). The other function is to provide for replication and 
expression of the gene constructs in a suitable host, either by maintenrnce as an 
extrachromosomal element or by integration into the host chromosome (expression vectors). 

25 A cloning vector comprises the DNAs as described above, an origin of replication or an 

autonomously replicating sequence, selectable marker sequences, and optionally, signal 
sequences and additional restriction sites. An expression vector additionally comprises 
expression control sequences essential for the transcription and translation of the edg gene. 
Thus, an expression vector refers to a recombinant DNA construct, such as a plasmid, a 

30 phage, recombinant virus or other vector that, upon introduction into a suitable host cell, 

results in expression of the cloned DNA. Suitable expression vectors are well known in the 
art and include those that are replicable in eukaryotic and/or prokaryotic cells. 



18 



wo 99/19513 



PCT/US98/21315 



Most expression vectors are capable of replication in at least one class of organisms 
but can be transfected into another organism for expression. For example, a vector is cloned 
in £ coli and then the same vector is transfected into yeast or mammalian cells even though it 
is not capable of replicating independently of the host cell chromosome. DNA may also be 
5 amplified by insertion into the host genome. However, the recovery of the genomic edg gene 

is more complex than that of exogenously replicated vector because restriction enzyme 
digestion is required to excise the gene. DNA can be amplified by PCR and directly 
transfected into the host cells without any replication component. 

Advantageously, expression and cloning vectors contain a selection gene also referred 
10 to as selectable marker. This gene encodes a protein necessary for the survival or growth of 

transformed host cells grown in a selective culture medium. Host cells not transformed with 
the vector containing the selection gene will not survive in the culture medium. Typical 
selection genes encode proteins that confer resistance to antibiotics and other toxins, e.g., 
ampicillin, neomycin, methotrexate or tetracycline, complement auxotrophic deficiencies, or 
1 5 supply critical nutrients not available from complex media. 

Suitable selectable markers for mammalian cells are those that enable the 
identification of cells competent to take up the edg gene, such as dihydrofolate reductase 
(DHFR, methotrexate resistance), thymidine kinase, or genes conferring resistance to G418 
or hygromycin. The mammalian cell transfectants are placed under selection pressure in 
20 which only those transfectants that are uniquely adapted to survive are those which have 

taken up and are expressing the marker. 

Expression and cloning vectors usually contain a promoter that is recognized by the 
host organism and is operably linked to the edg gene, iiuitable promoters may be inducible 
or constitutive. The promoters are operably linked to the edg gene by removing the promoter 
25 from the source DNA by restriction enzyme digestion and inserting the isolated promoter 

sequence into the vector. Both the native edg promoter sequence and many heterologous 
promoters may be used to direct amplification and/or expression of an edg. However, 
heterologous promoters are preferred, because they generally allow for greater transcription 
and higher yields of EDG as compared to native edg promoter. 
30 Promoters suitable for use with prokaryotic hosts include, for example, the (5- 

lactamase and lactose promoter systems, alkaline phosphatase, a tryptophan (trp) promoter 
system and hybrid promoters such as the tac promoter. Their nucleotide sequences have been 
published thereby enabling the skilled worker to ligate them to the edg gene using linkers or 
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adaptors to supply any required restriction sites. Promoters for use in bacterial systems will 
also generally contain a Shine-Dalgarno sequence operably linked to the edg gene. 

The various DNA segments of the vector DNA are operably linked, i.e., they are 
contiguous and placed in a functional relationship to each other. Construction of vectors 
5 according to the invention employs conventional ligation techniques. Isolated plasmids or 

DNA fragments are cleaved, tailored and religated in the form desired to generate the 
plasmids required. If desired, analysis to confirm correct sequences in the constructed 
plasmids is performed in a manner known in the art. Suitable methods for constructing 
expression vectors, preparing in vitro transcripts, introducing DNA into host cells, and 
10 performing analyses for assessing edg expression and function are known to those skilled in 

the art. Gene presence, amplification and/or expressio'i may be measured in a sample 
directly, for example, by conventional Southern blotting, northern blotting to quantitate die 
transcription of mRNA, dot blotting (DNA or RNA analysis), in situ hybridization, using an 
appropriately labeled probe based on a sequence provided herein, binding assays, 
1 5 immunodetection and functional assays. 

Suitable methods for manipulation of polynucleotides include those described in a 
variety of references, including, but not limited to, Molecular Cloning: A Laboratory 
Manual, 2nd Ed., Vol. 1-3, eds. Sambrook et al. Cold Spring Harbor Laboratory Press 
(1989); and Current Protocols in Molecular Biology, eds. Ausubel et al., Greene Publishing 
20 and Wiley-Interscience: New York (1987) and periodic updates. Those skilled in the art will 

readily envisage how these methods may be modified, if desired. 

The invention further provides host cells capable of producing edg and heterologous 
(foreign) polynucleotides encoding said receptor. The nucleic acids of the invention can be 
expressed in a wide variety of host cells, e.g., those meutioned above, that are transformed or 
25 transfected with an appropriate expression vector. EDG proteins (or a portion thereof) may 

also be expressed as fusion proteins. Recombinant cells can then be cultured under 
conditions whereby the protein(s) encoded by the particular edg is (are) expressed. 

Suitable prokaryotes include eubacteria, such as Gram-negative or Gram-positive 
organisms, such as E. coli, e.g., £. coli K-12 strains, DH5a and HBlOl, or bacilli. Further 
30 host cells suitable for edg-Qncoding vectors include eukaryotic microbes such as filamentous 

fungi or in a preferred embodiment, yeast, e.g., Saccharomyces cerevisiae. 

The advantage of a yeast system is that yeast contain few G-protein coupled receptors 
and it is therefor a simple task to show that the response of the EDG receptor to a particular 

20 
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phospholipid is dependent on the expression of the receptor since it is expressed from a 
galactose inducible promoter. This is in contrast to mammalian cells in which identity and 
distribution of LPA and other glycerophospholipids receptors is unclear. The results show 
that EDG-2 specifically responds to LPA. EDG-2 does not respond to other 
lysophospholipids or to diacyl-giycerophospholipids, in particular phosphatidic acid (PA) or 
to the related lipid messenger sphingosine-1 -phosphate (SPP). 

Higher eukaryotic cells include insect, amphibian and vertebrate cells, or mammalian 
cells. The methods for expressing proteins of interest in Sf9 cells are known in the art and are 
described in, for example Current Protocols in Molecular Biology, eds. Ausubel et al, 
Greene Publishing and Wiley-Interscience: New York (1987) and references therein. In 
recent years, propagation of vertebrate cells in culture (tissue culture) has become a routine 
procedure. The host cells referred to in this application comprise cells in in vitro culture as 
well as cells that are within a host animal. 

Host cells are transfected or transformed with the above-captioned expression or 
cloning vectors of this invention and cultured in conventional nutrient media modified as 
appropriate for inducing promoters, selecting transformants, or amplifying the genes 
encoding the desired sequences. Heterologous DNA may be introduced into host cells by any 
method known in the art, such as transfection with a vector encoding a heterologous DNA by 
the calcium phosphate coprecipitation technique, by electroporation or by lipofectin-mediated 
transfection. Numerous methods of transfection are known to the skilled worker in the field. 
Successful transfection is generally recognized when any indication of the operation of this 
vector occurs in the host cell. Transformation is achieved using standard techniques 

appropriate to the particular host cells used. 

Incorporation of cloned DNA into a suitable expression vector, transfection of 

eukaryotic cells with a plasmid vector or a combination of plasmid vectors, each encoding 

one or more distinct genes or with linear DNA, and selection of transfected cells are well 

known in the art (see, e.g., Sambrook et al. (1989) Molecular Cloning: A Laboratory Manual, 

Second Edition, Cold Spring Harbor Laboratory Press). 

Transfected or transformed cells are cultured using media and culturing methods 

known in the art, preferably under conditions whereby the particular edg is expressed. 

Suitable culturing media are either commercially available or readily prepared. 

The edg gene is ligated into a vector, and introduced into suitable host cells to 

produce transformed cell lines that express the edg gene. The resulting cell line can then be 
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produced in amounts sufficient for reproducible qualitative and quantitative analysis of the 
effects of a receptor-specific agonist, antagonist or allosteric modulator. The transfected cells 
can then be employed in an drug screening assay provided hereinafter. Such drugs are useful 
in diseases associated with pathogenesis of LP A. Particularly useful for assessing the 
specific interaction of compounds with an EDG receptor are stably transfected cell lines 
expressing the EDG. 

Cells expressing EDG polypeptides are useful for identifying substances that bind to a 
specific EDG. Identification of substances that bind to a specific EDG may be achieved by 
assessing the ability of a test compound to inhibit the binding of labeled ligand or analog 
thereof. Another method for identification of such substances involves assessing the ability 
of a test compound to inhibit specific antibody binding to an EDG. 

Cells expressing EDG polypeptides are also useful for elucidating the signal 
transduction pathways to which EDG is coupled. By "signal transduction pathway'' is meant 
the sequence of events that involves the transmission of a message from a cell-surface 
receptor to the cytoplasm. The signal will ultimately cause the cell to perform a particular 
function. 

Thus, host cells expressing an EDG are also useful for drug screening and it is an 
object of the present invention to provide a method for identifying a compound or signal 
which modulates the activity of the EDG. The method includes exposing cells containing 
heterologous edg, wherein said cells produce functional EDG, to at least one compound or 
signal whose ability to modulate the activity of said EDG is sought to be determined. The 
cells are then monitored for changes caused by the modulation. Such an assay enables the 
identification of agonists, antagonists and allosteric modulators of EDG. 

In a further aspect, the invention relates to an assay for identifying compounds that 
modulate the activity of any of the EDG family of receptors. The assay comprises the steps 
of: 

- contacting cells expressing an active EDG and containing heterologous edg with at 
least one compound to be tested for its ability to modulate the activity of said receptor, and 

- analyzing cells for a difference in second messenger level or receptor activity. 
Additionally, to determine the specificity of the compound for a particular member of 

the EDG family of receptors, the assay can further comprise the steps of: 

- contacting cells expressing one of the other EDG receptors, or the PSP-24 receptor 
and containing heterologous edg with the same compound tested above, and 
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- analyzing cells for a difference in second messenger level or receptor activity. 

If the compound only effects receptor activity in one EDG family member, it is more 
specific, and may be preferable for certain therapeutic purposes, for example, to reduce the 
likelyhood of undesired biological effects. 

In particular, the invention covers an assay for identifying compounds that modulate 
the activity of EDO's, said assay comprising: 

- contacting cells expressing an active EDG and containing heterologous edg with at 
least one compound to be tested for its ability to modulate the activity of said receptor, and 

- monitoring said cells for a resulting change in second messenger activity. 

The results obtained in these assays are compared to an assay suitable as a negative 

control- 
Assay methods generally require comparison to various controls. A change in 
receptor activity or in second messenger level is 'induced" by a test compound if such an 
effect does not occur in the absence of the test compound. An effect of a test compound on 
the receptor of the invention is "mediated" by the receptor if this effect is not observed in 
cells that do not express the receptor or express decreased amounts of the receptor. 

As used herein, a compound or signal that modulates the activity of an EDG receptor 
refers to a compound or signal that alters the response pathway mediated by the EDG within 
a cell (as compared to the absence or decreased amount of said EDG). A response pathw^ay is 
activated by an extracellular stimulus, resulting in a change in second messenger 
concentration or enzyme activity, or resulting in a change of the activity of a membrane- 
bound protein, such as a receptor or ion channel. A variety of response pathways can be 
utilized, including but not limited to, the adenylate cyclase response pathway, the 
phospholipase C/intracellular calcium ion response pathway or a response pathway involving 
activation of Ras or Rho. 

Apoptosis represents another important response pathway that may be modulated by 
EDG agonists or antagonists. Suitable indications for therapeutic use of EDG agonists or 
antagonists that result in modulation of apoptotic pathways include, but are not limited to, 
ischemic heart disease, tumors, viral diseases such as HIV infection, neurodegenerative 
disorders, inflammatory bowel disease, hair loss, and rejection of organ transplants. 

Thus EDG expressing cells may be employed for the identification of compounds, 
particularly low molecular weight molecules capable of acting as LPA agonists or 
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antagonists. Within the context of the present invention, an agonist refers to a molecule that 
is capable of interacting with one or more EDO's, thus mimicking the action of LPA. In 
particular, an LPA agonist is characterized by its ability to interact with EDG-2, thereby 
increasing or decreasing the stimulation of a response pathway within a cell. For example, an 
agonist increases or decreases a measurable parameter within the host cell, such as the 
concentration of a second messenger or modulation of apoptosis. 

By contrast, in situations where it is desirable to decrease the activity of an EDO, 
antagonists are useful. Liliom et al. (1996) Molec. Pharmacol 50:616-623; Bittman et al. 
(1996) J. Lipid Res. 37:391-398. Within the context of the present invention, an antagonist 
refers to a molecule that is capable of interacting with an EDO, but which does not stimulate 
a response pathway within a cell. In particular, LPA antagonists are generally identified by 
their ability to interact with EDG-2, and thereby reduce the ability of the natural ligand to 
stimulate a response pathway within a cell, e.g., by interfering with the binding of LPA to 
EDG-2 or by inhibiting other cellular functions required for the activity of EDG-2. For 
example, in a suitable assay, e.g., an assay involving suitable eukaryotic cells expressing 
EDG-2, a LPA antagonist is capable of modulating the activity of EDG-2 such that the ability 
of the natural ligand to activate the map kinase pathway is reduced. Yet another alternative 
to achieve an antagonistic effect is to rely on overexpression of antisense edg-2 RNA. 
Preferred is an agonist or antagonist selectively acting on EDG-2. 

An allosteric modulator of an EDG interacts with the receptor protein at another site 
than that recognized by any one of its particular ligands, thus acting as agonist or antagonist. 
Therefore, the screening assays described herein are also useful for detecting an allosteric 
modulator of a receptor of the invention. For example, an allosteric modulator acting as 
agonist may enhance the specific interaction between EDG-2 and LPA. For instance, if an 
allosteric modulator acts as an antagonist, it may interact with the receptor protein in such a 
way that binding of the agonist is functionally less effective. Examples include local 
anesthetics such as procaine, lidocaine, dibucaine and tetracaine. 

An in vitro assay for a LPA agonist or antagonist may require that an EDG is 
produced in sufficient amounts in a functional form using recombinant DNA methods. An 
assay is then designed to measure a functional property of the EDG, e.g., interaction with 
LPA. Production of EDG is regarded as occurring in sufficient amounts if activity of the 
receptor results in a measurable response. 
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For example, mammalian cells (available, e.g., from the American Tissue Type 
Culture Collection) are grown in appropriate culture medium. An EDG expression plasmid is 
transiently transfected into the cells, e.g., by calcium-phosphate precipitation. Ausubel, F.M. 
et al. (1993). Cell lines stably expressing the EDG may be generated, e.g., by lipofectin- 
mediated transfection with EDG expression plasmids and a plasmid comprising a selectable 
marker gene. Southern and Berg (1982) J. Mol. Appl. Genet 1:327-341. Cells surviving the 
selection axe isolated and grown in the selection medium. Resistant clonal cell lines are 
analyzed, e.g., for immunoreactivity with EDG-specific antibodies or by assays for EDG 
functional responses following agonist addition. Cells producing EDG are used in a method 
for delecting compounds binding to EDG or in a method for identifying an EDG Hgand 
agonist or antagonist. 

Compound bound to the target EDG may modulate functional properties of EDG and 
may thereby be identified as an EDG ligand agonist or antagonist in a functional assay. 
Functional assays are used to detect a change in the functional activity of EDG' s, for 
instance, as a result of the interaction of the compound to be tested with an EDG. A 
functional response is a change (difference) in the concentration of a relevant second 
messenger influenced by the receptor of the invention within cells expressing a functional 
EDG (as compared to a negative control). Those of skill in the art can readily identify an 
assay suitable for detecting a change in the level of an intracellular second messenger 
indicative of the expression of active EDG-2 (functional assay). Examples include cAMP 
assays (see, e.g., Nakajima et al. (1992) J. Biol Chem, 247:2437-2442); Tigyi et al. (1996) J. 
Neurochem. 66:549-558) measuring changes in inositol 1,4,5-triphosphate levels (Tigyi et al. 
(r^96) J. Neurochem. 66:537-548), measuring CI" ion efflux (Postma et al. (1996) EMBOJ. 
15:63-72; Watsky (1995) Am. J. Physiol. 269:C1385-C1393), or measuring changes in 
intracellular Ca^"" levels (Tigyi et al. (1996) / Neurochem. 66:537-548). 

More specifically, according to the invention, a method for detecting a LPA agonist 
comprises the steps of (a) exposing a compound to EDG-2 coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with EDG-2 
and an associated response through the pathway, and (b) detecting an increase or decrease in 
the stimulation of the response pathway resulting from the interaction of the compound with 
EDG-2, relative to the absence of the tested compound and therefrom determining the 
presence of a LPA agonist. 
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A method for identifying a LPA antagonist comprises the steps of (a) exposing a 
compound in the presence of a known LPA agonist to EDG-2 coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the agonist with the receptor 
and an associated response through the pathway, and (b) detecting an inhibition of the 
5 stimulation of the response pathway induced by the agonist, said inhibition resulting from the 

interaction of the compound with EDG-2, relative to the stimulation of the response pathway 
by the LPA agonist alone and determining therefrom the presence of a LPA antagonist. 
Inhibition may be detected if the test compound competes with the LPA agonist for EDG-2. 
Compounds which may be screened utilizing such a method include, but are not limited to, 
10 blocking antibodies specifically binding to EDG-2. Furthermore, such an assay is useful for 

the screening for compounds interacting with LPA. In this case, the agonistic effect is 
neutralized or reduced, e.g., by binding of the test compound to the agonist, thus affecting 
agonist interaction with the receptor. Examples are soluble EDG-2 fragments comprising 
part or all of the ligand binding domain. 
1 5 Preferentially, interaction of an agonist or antagonist with EDG-2 denotes binding of 

the agonist or antagonist to said EDG-2. 

As employed herein, conditions and times suffi'.nent for interaction of an LPA agonist 
or antagonist candidate with the receptor will vary with the source of the receptor, however, 
conditions generally suitable for binding occur between about 4°C and about 40°C, 
20 preferably between about 4^C and about 37°C, in a buffer solution between 0 and 2 M NaCl, 

preferably between 0 and 0.9 M NaCl, with 0.1 M NaCl being particularly preferred, and 
within a pH range of between 5 and 9, preferably between 6.5 and 8. Sufficient time for the 
binding and response will generally be between about 1 ms and about 24 h after exposure. 
Within one embodiment of the present invention, the response pathway is a 
25 membrane-bound Map Kinase pathway, and, for an agonist, the step of detecting comprises 

measuring a reduction or increase, preferably a reduction, in lacZ production by the 
membrane-bound response pathway, relative to the lacZ production in the relevant control 
setup. For the purpose of the present invention, it is preferred that the reduction or increase 
in lacZ production be equivalent or greater than the reduction or increase induced by LPA 
30 applied at a concentration corresponding to its IC50 value. For an antagonist, the step of 

detecting comprises measuring in the presence of the antagonist a smaller LPA-induced 
decrease or increase in lacZ production by the membrane-bound response pathway, as 
compared to the lacZ production in the absence of the antagonist. The measurement oilacZ 
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may be performed after cell destruction or by a lacZ sensitive molecular probe loaded into the 
cell. 

Yeast contains multiple MAP kinase cascades that are functionally analogous to the 
Mitogen Activated Protein Kinase (MAPK) cascade in mammalian systems (Brewster, J. L., 
5 etal. (1993) &/e«ce 259, 1760-1763; Irie, K., et ah 0993) ^V/o/. Ce//. S/o/. 13, 3076-3083; 

Neiman, A. M., and Herskowitz, I. (1993) Trends Genet. 9, 390-394; Chang, E. C, et aL 
(1994) Cell 79, 131-141). A schematic of the S. cerevisiae pheromone response pathway and 
the relevant genetic components are shown in Figure L The parental yeast strain, SY2069, 
contains the FUSl promoter fused to lacZmd HIS3 integrated into different chromosomal 
1 0 loci and carries the far 1 -bad allele. The FAR! gene product is required for cell cycle arrest 

following exposure to mating pheromone (see Figure I). By deleting this gene, the cells are 
able to grow in the presence of MAP kinase activation. In addition, a null mutation in the 
SST2 gene was created because it has been previously reported that the Sommatostatin 
receptor can efficiently couple to the endogenous yeast heterotrimeric G-protein after 
1 5 mutationally inactivating the SST2 gene (Price, L. A., Kajkowski, E. M,, Hadock, J, R., 

Ozenberger, B. A., and Pausch, M. H. (1995) Mol Cell Biol 15(1 1), 6188-6195). SST2 
encodes a GTPase activating protein (GAP) for the CPA I gene product, the Ga subunit 
required for mating pheromone signal transduction. The effect of inactivating SST2 is that 
Gpal remains in the GTP-bound state longer and thus permits signaling through the Py dimer 
20 to proceed at a higher rate, resulting in a higher signal from the receptor. 

Further cell-based screening assays can be designed, e.g., by constructing cell lines in 
which the expression of a reporter protein, /.e., an easily assay able protein, such as [i- 
galactosidase, chloramphenicc] acetyltransferase (CAT) or luciferase, is dependent on the 
function of an EDG. The resulting DNA construct comprising the enzyme DNA is stably 
25 transfected into a host cell. The host cell is then transfected with a second DNA construct 

containing the edg gene operably linked to additional DNA segments necessary for the 
expression of the receptor. 

Also encompassed by methods of the present invention is the use of the EDG family 
of receptors in an expression vector for radioligand binding assays, such as that described in 
30 Price, L. A., et al. (1996) MoL Pharmacol, 50(4), 829-837. Such methods can be used for 

detecting compounds that compete with known EDG binding ligands for binding to the EDG 
related receptors. For example, first labeled 18:1 lysophosphatidic acid ligand is introduced 
to a sample containing yeast cells expressing EDG-2 receptor to allow binding of the 1 8: 1 
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LPA to the EDG-2 receptor. Next, a composition to be tested for its ability to modulate the 
interaction of 1 8: 1 LPA and its receptor is introduced to the yeast cell. Finally, the yeast 
sample is evaluated for an increase or decrease in the amount of labeled agonist bound to the 
yeast cell or membranes purified from the yeast cell. Methods of labeling compounds, and 
labels themselves, are known in the art and include, but are not limued to, radioisotopes, 
enzymes, fluorescent compounds, chemiluminescent compounds, bioluminescent 
compounds, substrate cofactors and inhibitors. See, for examples of patents teaching the use 
of such labels, U.S. Patent Nos, 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; and 4,366,241. By these methods, one can identify the presence of an EDG 
related receptor modulator, such as an antagonist, inverse agonist or agonist. 

The edg gene family is useful for a variety of studies. These include, but are not 
limited to, testing LPA analogs for agonistic/antagonistic activity and for EDG-2 specificity 
when compared to other EDG family members; dissecting the molecular signal transduction 
mechanism, analyzing receptor-ligand interactions by site-directed mutagenesis; determining 
the levels and distribution of the receptors; cloning related receptors; and determining the 
mechanism of tissue-specific responses to LPA and other EDG family ligands. 

Recently, it has been discovered that a lysophospholipid effects Candida adherence in 
vitro. Prakobphol, A., et al. "Plamitoyl carnitine, a lysophospholipase-transacylase inhibitor, 
prevents Candida adherence in vitro," FEMS Microbioi Lett (1997) 151:89-94. By 
identifying a lysophosphatidic acid receptor in Candida, and inserting that receptor into an 
assay of the present invention, one can screen for potential anti-fungal agents that modulate 
the interaction of LPA and such a receptor. 

Methods of the present invention can be used to identify an LPA receptor from a 
heterologous fungal species, such as Candida. For example, one can a) contact recombinant 
host cells, modified to contain a fragment of at least 1 8 consecutive nucleotides of DNA 
selected from the genomic library of a chosen fungal species, which is operably linked to 
control sequences for expression, with LPA. Next, one can b) analyze the cells for a change 
in functional response to said LPA. Finally, one can repeat steps a) and b) with different 
fragments from the genomic library of the heterologous fungus until a fragment is identified 
that evokes a change in functional response upon treatment with LPA. 

The following examples are provided to illustrate but not limit the invention. 
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EXPERIMENTAL 

The following abbreviations, used in the experimental section and throughout the 
specification, are herein set forth; LPA (lysophosphatidic acid), LPE 
(lysophosphatidlylethanolamine), LPS (lysophosphatidlysenne), LPC 
5 (lysophosphatidlycholine), LPG ( lysophosphatidlyglycerol), PA (phosphatidic acid), PE 

(phosphatidylethanolamine), PS (phosphatidylscrine), PC (phosphatidylcholine), PG 
(phosphatidylglycerol), Sph-l-P (shingosine-1 -phosphate), BBS (Bicarbonate buffered 
saline). 

EXAMPLE 1 

10 Construction of the EDG-1, EDG-2, EDG-3, EDG-4 and EDG-5 Expression Plasmids 

and Expression in Yeast 
SY2069 {Mata,farl-bad3, HIS3::pFUSl::HIS3, mfa2-Al : :pFUSl r.lacZ, uraS, leu2, 
adel, arg4, trpl) was used to derive an sstT strain for subsequent studies. SST2 was 
disrupted using pBC14 (Dohlman, H. G., et al. (1996) Moi Cell. Biol. 16(9), 5194-5209). 

15 pBC14 was digested with Ncol and transformed into SY2069 by lithium acetate usmg the 

URA3 gene for selection, lira"" colonies were grown in non-selective media (YEPD) and 
plated onto media containing 5-Flouro-orotic Acid (5-FOA, Sigma). The resultant 5-FOA 
resistant isolates were tested for the supersensitive phenotype by assaying lacZ activity in 
response to a-factor (data not shown). One sst2 strain was named JEY5 and used in all 

20 subsequent studies. Yeast were grown in SC + 2% Galactose or 2% Glucose media lacking 

Uracil. 

Construction of EDG-2 Expression Piasmid and Er pression in Yeast: The EDG-2 
coding region was amplified by RT-PCR using Pfu DNA polymerase under conditions 
described by the supplier (Stratagene). The template for RT-PCR was cDNA (5ng) that was 
25 reverse transcribed from human fetal brain total RN A (Clontech) using Superscript II 

Reverse Transcriptase as described by the supplier (Gibco BRL), 1 (iM each of the following 
primers, FP; 5'-GCGATAGGATCCATCATGGCTGCCATCTCTACTTC-3' (SEQ ID NO. 

2) and RP; 5'-GCGATACTCGAGCTAAACCACAGAGTGATCATTGC-3' (SEQ ID NO. 

3) , were used for RT-PCR. Oligonucleotide Synthesis and DNA Sequencing: PCR primers 
30 and DNA sequencing primers were synthesized by the phosphoramidite method with an 

Applied Biosystems model 394 synthesizer, purified by polyacrylamide gel electrophoresis 
and desalted on Sep-Pak Cig cartridges (Waters Associates, Milford, MA). The edg-2 cDNA 
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was sequenced in pYEUra3 by the dideoxy chain termination method using the T7 
Sequenase7-deaza-dGTP sequencing kit as described by the supplier (Amersham Life 
Science). The primers were designed based on the human edg-2 cDNA sequence submitted 
to Genbank by Zondag and Moolenaar (accession no. Y09479) and included restriction site 

5 extensions for subcloning into the pYEUra3 vector (Stratagene). This placed the cDNA 

under the control of a galactose-inducible promoter (UASgal). The resulting plasmid was 
used to transform JEY5 by the lithium acetate method. 

Construction of EDO- 1, EDG-3, EDG-4 and EPG-S Expression Plasmid and 
Expression in Yeast: The other EDG Expression Plasmids were prepared by the same 

10 method described above for EDG-2, except that the coding regions were amplified by RT- 

PCR using the following primers, respectively: 

EDG-1:FP; 5'- GCGCGGGATCCACCATGGGGCCCACCAGCGTCCCG-3' (SEQ ID 
NOTO) 

15 RI^ 5'- GCGCGGTCGACGGAAGAAGAGTTGACGTTTCC-3^ (SEQ ID NO. 1 1 ) 

EDG-3:FP; 5'-GCGCGGGATCCACCATGGCAACTGCCCTCCCG-3' (SEQ ID NO. 12 
^ RP; 5^-GCGCGGTCGACTCAGTTGCAGAAGATCCCATTC-3' (SEQ ID NO. 13 ) 

20 

EDG-4 FP; 5^ATCAGCGGATCCACCATGGTCATCATGGGCCAGT-3' (SEQ ID NO. 14 ) 
RP; 5^-AGTTCACTCGAGTCAGTCCTGTTGGTTGGGTTG-3' (SEQ ID NO. 15 ) 

EDG-5:FP; 5'-GCGCGGGATCCACCATGGGCGGriTATACTCAGAG-3' (SEQ ID NO. 
25 16) 

RP; 5'-GCGCGGTCGACTCAGACCACTGTGTTGCCC-3' (SEQ ID NO. 17). 

Expression of the EDG-2: To test the effects of the SST2 gene product on the edg-2 
response to LPA, JEY5 (sst2A) expressing the EDG-2 receptor was compared to the parental 

30 SY2069 strain {SST2^). Figure 2 shows that the SST2^ strain was unresponsive to LPA 

whereas the sst2' derivative was activated by 200^M LPA. As a control, JEY5-^EDG-2 was 
assayed in 2% glucose such that the GALI promoter would be repressed and thus not 
expressing the edg-2 gene (glucose repression is described in detail by Johnston, M., and 
Carlson, M. (1992) in The molecular and cellular biology of the yeast Saccharomyces: gene 

35 expression (Broach, J. R., Pringle, J. R., and Jones, E, W., eds), pp. 193-281, Cold Spring 

Harbor Laborartory Press, Plainview, N.Y.). 
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lacZ assays in response to phospholipids: JEY5 + pJEl5 was grown on SC media 
containing either 2% galactose or 2% glucose lacking uracil to an approximate optical density 
of 0. 1 -0.5 prior to the addition of lipid or a-factor. LPA and other glycerophospholipids 
(Avanti Polar Lipids) were dissolved in chloroform and dried down under vacuum 
immediately prior to experiments and resuspended in BBS/EDTA (50mM NH4HCO3, 
104mM NaCl, 250 mM EDTA 2Na) at Img/ml with sonication until the solution was clear. 
Sphingosine-1 -phosphate (Matreya) was resuspended in ethanol/water (9:1 ) pH 3.0 
immediately prior to use. Fatty Acid Free Bovine Serum Albumin was obtained from Sigma 
(St. Louis, MO) and used at O.lmg/ml in BBS/EDTA. Cells were allowed to grow for the 
indicated time (7 hours for dose response experiments) prior to assaying. 100^1 of yeast 
culture were then added to 900fil assay buffer (per liter: 60 mM Na2HP04, 40mM NaH2P04, 
lOmM KCl, 0.1 mM MgS04, pH 7.0 plus 2.7ml [J-mercaptoethanol per liter) plus 50|il 
0.1%SDS + three drops Chloroform. Cells were vortexed for 10 seconds and incubated for 5 
minutes at 28*'C. 200nL of 4mg/ml o-nitrophenol-P-D-galactopyranoside {ONPG, Sigma) 
were added and the reaction was incubated 30 minutes at 28°C. The assay was stopped by 
the addition of 500^1 IM Na2C03. Color development was measured at A420 and normalized 
to A^oo- Units were expressed as Miller Units. 

Figure 2 shows that the induction of lacZ activity is dependent on i) the pYEUra3- 
Edg2 plasmid being present in the yeast cell, 2) the yeast cell containing the pYEUra3-Edg2 
plasmid being grown on the sugar galactose such that the UASgal promoter which drives the 
expression of the Edg2 gene is induced and 3) lysophosphatidic acid being present. 

To further characterize the LPA response to EDG-2, the time and dose dependency of 
LPA activation was tested. As seen in figure 3 A, LPA results in a time-dependent increase in 
lacZ activity as compared to vector control with a maximal four-fold stimulation of activity at 
7 hours. The dose response of EDG-2 to LPA is shown in Figure 3B (E.C.50=20|iM-30)iM). 
LPA concentrations above 600 |iM could not be tested due to toxicity. However, the dose 
response curve can been seen to plateau suggesting that maximal activity has been reached. 
This toxicity was seen in other glycerophospholipids tested at 200|iM (see below). However, 
the response to LPA was significantly above the vector control suggesting that EDG-2 
confers LPA responsiveness upon yeast. LPA resuspended in BBS/EDTA + fatty acid free 
albumin showed the same activity as did freely soluble LPA (data not shown and see below). 



31 



wo 99/19513 



PCT/US98/21315 



The results support that the expression of EDG-2 in yeast faithfully reconstitutes 
many of the key properties of an LPA receptor. 

5 EXAMPLE 2 

EDG-2 responds selectively to LPA and not to other lysophospholipids or to 
corresponding diacyl-glycerophospholipids 
Yeast does not have endogenous receptors for glycerphospholipids such as LPA. 
Therefor, yeast represented an excellent naive system to evaluate the agonist binding 
10 specificity of EDG-2. Lysophosphatidylethanolamine (LPE), -serine (LPS), -glycerol (LPG ) 

and -choline (LPC) and Sphingosine-1 -phosphate (SPP) were tested over the same dose range 
as LPA. 

The results are consistent with EDG-2 being a functional, specific LPA receptor. As 
seen in Figure 4, no other lyso-glycerolphospholipid or Sphingosine-1 -phosphate activated 
1 5 EDG-2 as well as did LPA at concentrations up to 200 |iM, the highest concentration tested 

due to toxicity. The results of a similar experiment testing the effects of the diacyl- 
glycerophospholipids are seen in Figure 5. In this experiment, no diacyl-glycerophospholipid 
significantly activated EDG-2 except phosphatidic acid (PA). However, the activity of PA 
may be due to contaminating LPA as determined by HPLC (data not shown). 

20 

EXAMPLE 3 

Acyl-chain length dependency of EDG-2 activity by LPA 
Due to the apparent specificity of EDG-2 for LPA, the dependency of the acyl-chain 
length of the LPA molecule on EDG-2 activation was also determined. Six forms of LPA 
25 were tested: 24:1, 18:1 (Oleoyl), 18:0 (Stearoyl), 16:0 (Palmitoyl), 14:0 (Myristoyl) and 6:0 

(Caproyl). The experiment was carried out as described in Example 2, above, using the four 
forms of LPA. 

Figure 6 shows that those LPA molecules containing 1 6 or 18 carbons activated EDG- 
2. This experiment was repeated with the LPA analogs suspended in 0.1 mg/ml fatty acid 
30 free BSA with similar results (data not shown). 

EXAMPLE 4 

LPA in a liposomal formulation is effective in EDG-2 activation 
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To test the effects of presenting LPA to the yeast cells as a Uposome rather than as 
freely soluble LPA or bound to albumin, liposomes were formed with either 
phosphatidylcholine or phosphatidylglycerol. To formulate the liposomes, LPA and dioleoyl- 
phosphatidylcholine or l-palmitoyI-2-oleoyl-phosphatidylglycerol were mixed at a weight 

5 ratio of 1 :9 (LPA:PC or PG) in Chloroform solution, dried under vacuum (Savant) and 

resuspended in BBS/EDTA to a total lipid concentration of lOmg/ml (LPA concentration = 
Img/ml). The resultant opaque suspension was sonicated until the solution was clear 
(approximately 10 minutes, Lab Supplies Co., Hicksville, NY). The liposome size ranged 
from 50nm to 80nm as determined on a Coulter N4 Plus Particle Sizer. (Coulter). 

10 Figure 7 shows that LPA-containing liposomes showed equivalent activity towards 

EDG-2 as did freely soluble LPA. Phosphatidylcholine and phosphatidylglycerol as 
liposome preparations gave no activity on their own (Figure 4). These results support that the 
form in which LPA is presented does not significantly effect its ability to activate EDG-2; 
albumin-bound, freely soluble and liposomal formulations of LPA all activated EDG-2 with 

1 5 equal efficacy. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity and understanding, it will be apparent to 
those skilled in the art that certain changes and modifications may be practiced. Therefore. 
20 the description and examples should not be construed as limiting the scope of the 

invention, which is delineated by the appended claims. 
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CLAIMS 

We claim: 

1 . A method for identifymg compounds that modulate the activity of an EDG 
related receptor, comprising the steps of: 

a) contacting recombinant host cells, modified to contain a fragment of at least 
18 consecutive nucleotides of DNA selected from the group comprismg SEQ. ID. NOS. 1,4, 
5, 6, 7 and 8, which is operably linked to control sequences for expression, with at least one 
compound or signal whose ability to modulate the activity of the particular EDG receptor is 
sought to be determined, and 

b) analyzmg the cells for a difference in functional response mediated by said 

receptor. 

2. A method according to claim 1 , comprising: 

a) contacting said cells with at least one compound or signal whose ability to 
modulate the activity of said receptor is sought to be determined, and 

b) monitoring said cells for a change in the level of a particular second 
messenger. 

3. A method for modulating the signal transduction activity of an EDG receptor, 
comprising contacting said receptor with an effective amount of at least one compound 
identified by the method of claim 1 . 

4. An agonist, antagonist or allosteric modulator identified by the method of claim 1 . 

5. A method for detecting an LPA agonist or an allosteric modulator of an LPA 
receptor having agonistic activity comprising the steps of: 

a) exposing a compound to the EDG-2 receptor coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with the 
EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
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absence of the tested compound and therefrom detenninmg the presence of an agonist or an 
allosteric modulator. 

6. A method for detectmg an LP A antagonist or an allosteric modulator of LP A 
receptor having antagonistic activity comprising the steps of 

a) exposing a compound to the EDG-2 receptor coupled to a response pathway, 
under conditions and for a time sufficient to allow interaction of the compound with the 
EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an antagonist or 
an allosteric modulator. 

7. A composition comprising a viable yeast cell transformed with a plasmid 
expressing an EDG related receptor protein selected from the group comprising SEQ. ID NO: 
L 4, 5, 6, 7 and 8, a pheromone responsive element and a reporter gene, wherein the reporter 
gene detects the activation of the LPA receptor. 

8. The composition of claim 7, wherein said EDG related receptor protein has an 
amino acid sequence corresponding to SEQENCE ID NO. 1 . 

9. The composition of claim 7, wherein said yeast cell is Saccharomyces 
cerevisiae. 

10. The composition of claim 7, wherein said reporter gene is lacZ. 

11. A method for identifying compounds which modulate the activity of EDG-2, 
comprising the steps of: 

a) contacting recombinant host cells, modified to contain the DNA of SEQ. ID. NO. 1 
which is operably linked to control sequences for expression, with at least one compound or 
signal whose ability to modulate the acUvity of the EDG-2 LPA receptor is sought to be 
determined, and 
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b) analyzing the cells for a difference in functional response mediated by said receptor. 

12. A method for modulating the signal transduction activity of the EDG-2 LP A 
receptor, comprising contacting said receptor with an effective amount of at least one 
compound identified by the method of claim 1 1. 

13. An agonist, antagonist, inverse agonist, or allosteric modulator identified by 
the method of claim 1 1 . 

1 4. A method for detecting an LPA agonist or an allosteric modulator of an LPA 
receptor having agonistic activity comprising the steps of: 

a) exposing a compound to the EDG-2 receptor (SEQ ID NO. 1 ) coupled to a response 
pathway, under conditions and for a time sufficient to allow interaction of the compound with 
the EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an agonist or an 
allosteric modulator. 

15. A method for detecting an LPA antagonist or an allosteric modulator of LPA 
receptor having antagonistic activity comprising the steps of 

a) exposing a compound to the EDG-2 receptor (SEQ ID NO. 1 ) coupled to a response 
pathway, under conditions and for a time sufficient to allow interaction of the compound with 
the EDG-2 receptor and an associated response through the pathway, and 

b) detecting an increase or a decrease in the stimulation of the response pathway 
resulting from the interaction of the compound with the EDG-2 receptor, relative to the 
absence of the tested compound and therefrom determining the presence of an antagonist or 
an allosteric modulator. 

1 6. A method for detecting inverse agonists of LPA, comprising the steps of 

a) exposing a compound and LPA to the EDG-2 receptor (SEQ ID NO. 1) coupled to a 
response pathway, under conditions and for a time sufficient to allow interaction of LPA with 
the EDG-2 receptor and an associated response through the pathway; and 
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b) detecting a change in the stimulation of the response pathway, relative to the 
absence of the tested compound and therefrom determining the presence of an inverse agonist 
ofLPA- 

17. A genetically engineered viable yeast cell transformed with plasmids 
expressing an EDG related receptor protein having an amino acid sequence corresponding to 
Seq. ID. NO. 1, 4, 5, 6, 7 or 8, and a reporter gene, wherein the reporter gene detects the 
activation of the EDG related receptor upon the binding of modulators of the EDG related 
receptor. 

18. The yeast cell of claim 17 wherein said yeast cell is selected from the group 
consisting of Saccharomyces cerevisiae and Saccharomyces pombe. 

19. The yeast cell of claim 17, wherein the reporter gene is lac Z. 

20. The yeast cell of claim 17, wherein said EDG related receptor protein has an 
amino acid sequence corresponding to Seq. ID. NO. 1 . 

21. A method for analyzing the specificity of compounds which modulate the 
activity of an EDG related receptor, comprising the steps of: 

a) contacting a first group of recombinant host cells, modified to contain a first 
fragment of at least 18 consecutive nucleotides of DNA selected from the group comprising 
SEQ. ID. NOS. 1 , 4, 5, 6, 7 and 8, which is operably linked to control sequences for 
expression, with at least one compound or signal whose ability to modulate the activity of the 
particular EDG receptor is sought to be determined, 

b ) contacting a second group of recombinant host cells, modified to contain a second 
fragment of at least 18 consecutive nucleotides of a different DNA selected from the group 
comprising SEQ. ID. NOS. 1, 4, 5, 6, 7 and 8, and 

c ) analyzing the cells from steps a) and b) for a difference in functional response 
mediated by said first and said second receptors. 

22 . A method according to claim 2 1 , comprising: 
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a) contacting said first and said second group of recombinant host cells with at least 
one compound or signal whose ability to modulate the activity of said receptor is sought to be 
determined, and 

b) monitoring said first and said second group of recombinant host cells for a change 
in the level of a particular second messenger. 

23. A method of detecting compounds that modulate the interaction between a 
ligand of an EDO related receptor and the EDO related receptor, comprising: 

a) exposing a labeled ligand of an EDG related receptor to a cell expressing said 
EDG related receptor; 

b) exposing a labeled compound that is believed to interact with an EDG related 
receptor to said cell, and 

c ) detecting a change in the amount of labeled ligand bound to said cell. 

24. A method of identifying an LPA receptor from a heterologous fungal species, 
comprising: 

a) contacting recombinant host cells, modified to contain a fragment of at least 
18 consecutive nucleotides of DNA selected from the genomic library of said fungal species, 
which is operably linked to control sequences for expression, with LPA, 

b) analyzing the cells for a change in functional response, and 

c) repeating steps a) and b) with different fragments from said genomic library 
until a fragment is identified that evokes a change in functional response of said cells upon 
treatment with LPA. 
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Figure 2 
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Figure 3A 
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Figure 3B 
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Figure 4 
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Figure 5 
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Figure 6 
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Figure 7 
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SEQUENCE LISTING 



<110> Eri)cson, James 

Goddard, J. Graham 

Kiefer, Michael 

LXR Biotechnology, Inc. 

<12 0> METHODS FOR DETECTING COMPOUNDS WHICH MODULATE THE 
ACTIVITY OF AN LPA RECEPTOR 

<130> 4147-6-PCT 

<140> PCT/US98/21315 
<141> 1998-10-09 

<160> 19 

<170> PatentIn Ver. 2.0 

<210> 1 

<211> 1110 

<212> DNA 

<2 13> Homo sapiens 

<:220> 

<:221> CDS 

<:222> (10) . . (1101) 

<400> 1 

ggatccatc atg get gcc ate tct act tec ate cct gta att tea cag ecc 51 
Met Ala Ala lie Ser Thr Ser lie Pro Val lie Ser Gin Pro 
15 10 

cag ttc aca gcc atg aat gaa cca cag tgc ttc tac aac gag tec att 99 
Gin Phe Thr Ala Met Asn Glu Pro Gin Cys Phe Tyr Asn Glu Ser lie 
15 20 25 30 

gcc ttc ttt tat aac cga agt gga aag cat ctt gee aca gaa tgg aac 147 
Ala Phe Phe Tyr Asn Arg Ser Gly Lys His Leu Ala Thr Glu Trp Asn 
35 40 45 

aca gtc age aag ctg gtg atg gga ctt gga ate act gtt tgt ate ttc 195 
Thr Val Ser Lys Leu Val Met Gly Leu Gly lie Thr Val Cys lie Phe 
50 55 60 

ate atg ttg gcc aac eta ttg gtc atg gtg gea ate tat gtc aac cgc 243 
lie Met Leu Ala Asn Leu Leu Val Met Val Ala lie Tyr Val Asn Arg 

65 70 75 

cgc ttc cat ttt cct att tat tac eta atg get aat ctg get get gca 291 
Arg Phe His Phe Pro lie Tyr Tyr Leu Met Ala Asn Leu Ala Ala Ala 
80 85 90 

gac ttc ttt get ggg ttg gee tac ttc tat etc atg ttc aac aca gga 339 
Asp Phe Phe Ala Gly Leu Ala Tyr Phe Tyr Leu Met Phe Asn Thr Gly 
95 100 105 110 

cec aat act egg aga ctg act gtc age aca tgg etc ctt cgt cag ggc 387 
Pro Asn Thr Arg Arg Leu Thr Val Ser Thr Trp Leu Leu Arg Gin Gly 
115 120 125 
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etc att gac acc age ctg acg gca tct gtg gcc aac tta ctg get att 
Leu He Asp Thr Ser Leu Thr Ala Ser Val Ala Asn Leu Leu Ala He 
130 135 140 



435 



gca ate gag agg cac att acg gtt ttc cgc atg cag etc cac aca egg 
Ala He Glu Arg His He Thr Val Phe Arg Met Gin Leu His Thr Arg 
145 150 155 



483 



atg age aae egg egg gta gtg gtg gtc att gtg gtc ate tgg act atg 
Met Ser Asn Arg Arg Val Val Val Val He Val Val He Trp Thr Met 
160 165 170 



531 



gcc ate gtt atg ggt get ata cce agt gtg gge tgg aac tgt ate tgt 
Ala He Val Met Giy Ala He Pro Ser Val Gly Trp Asn Cys He Cys 
175 180 185 190 



579 



gat att gaa aat tgt tec aac atg gca cce etc tac agt gac tct tac 
Asp He Glu Asn Cys Ser Asn Met Ala Pro Leu Tyr Ser Asp Ser Tyr 
195 200 205 



627 



tta gtc ttc tgg gee att ttc aac ttg gtg acc ttt gtg gta atg gtg 
Leu Val Phe Trp Ala He Phe Asn Leu Val Thr Phe Val Val Met Val 
210 215 220 



675 



gtt etc tat get cac ate ttt gge tat gtt cge cag agg act atg aga 
Val Leu Tyr Ala His He Phe Gly Tyr Val Arg Gin Arg Thr Met Arg 
225 230 235 



723 



atg tct egg cat agt tct gga cce egg egg aat egg gat acc atg atg 
Met Ser Arg His Ser Ser Gly Pro Arg Arg Asa Arg Asp Thr Met Met 
240 245 250 



771 



agt ctt ctg aag act gtg gtc att gtg ctt ggg gcc ttt ate ate tgc 
Ser Leu Leu Lys Thr Val Val He Val Leu Gly Ala Phe He He Cys 
255 260 265 270 



819 



tgg act cct gga ttg gtt ttg tta ctt eta gac gtg tgc tgt cea cag 
Trp Thr Pro Gly Leu Val Leu Leu Leu Leu Asp Val Cys Cys Pro Gin 
275 280 285 



867 



tgc gac gtg ctg gcc tat gag aaa ttc ttc ctt etc ctt get gaa ttc 
Cys Asp Val Leu Ala Tyr Glu Lys Phe Phe Leu Leu Leu Ala Glu Phe 
290 295 300 



915 



aac tct gee atg aae cce ate att tac tec tac cgc gac aaa gaa atg 
Asn Ser Ala Met Asn Pro He He Tyr Ser Tyr Arg Asp Lys Glu Met 
305 310 315 



963 



age gcc acc ttt agg cag ate etc tgc tgc cag cgc agt gag aac cce 
Ser Ala Thr Phe Arg Gin He Leu Cys Cys Gin Arg Ser Glu Asn Pro 
320 325 330 



1011 



acc gge cce aca gaa gge tea gac cgc teg get tec tec etc aac cac 
Thr Gly Pro Thr Glu Gly Ser Asp Arg Ser Ala Ser Ser Leu Asn His 
335 340 345 350 



1059 



acc ate ttg get gga gtt cac age aat gat cac tct gtg gtt tagctcgag 1110 
Thr He Leu Ala Gly Val His Ser Asn Asp His Ser Val Val 
355 360 
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<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 2 

gcgataggat ccatcatggc tgccatctct acttc 35 



<210> 3 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 3 

gcgatactcg agctaaacca cagagtgatc attgc 35 



<210> 4 

<211> 1149 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1146) 

<400> 4 

atg gtg tec act age ate ccg gag gtt aaa get etc cgc age tea gtc 48 

Met Val Ser Thr Ser lie Pro Glu Val Lys Ala Leu Arg Ser Ser Val 
15 10 15 

tct gae tat ggg aac tat gat ate ata gtc egg cat tac aac tac aca 96 
Ser Asp Tyr Gly Asn Tyr Asp He He Val Arg His Tyr Asn Tyr Thr 
20 25 30 

ggc aag ttg aac ate ggg gcg gag aag gac eat ggc att aaa ctg act 144 
Gly Lys Leu Asn He Gly Ala Glu Lys Asp His Gly He Lys Leu Thr 
35 40 45 

tea gtg gtg ttc att etc ate tgc tgc ttc ate ate eta gag aat ata 192 
Ser Val Val Phe He Leu He Cys Cys Phe He He Leu Glu Asn He 
50 55 60 

ttt gtc ttg eta act att tgg aaa ace aag aag ttc cac egg ccc atg 240 
Phe Val Leu Leu Thr He Trp Lys Thr Lys Lys Phe His Arg Pro Met 
65 70 75 80 

tac tat ttc ata ggc aac eta gcc etc teg gac eta tta gca ggc gtg 288 
Tyr Tyr Phe He Gly Asn Leu Ala Leu Ser Asp Leu Leu Ala Giy Val 
85 90 95 

get tac aca get aac ctg ctg ttg tct ggg gcc acc act tac aag etc 336 
Ala Tyr Thr Ala Asn Leu Leu Leu Ser Gly Ala Thr Thr Tyr Lys Leu 
100 105 110 
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aca cct gcc cag tgg ttt ctg egg gaa ggg agt atg ttt gtg get ^ctc 384 
Thr Pro Ala Gin Trp Phe Leu Arg Giu Giy Ser Met Phe Val Ala Leu 
115 120 125 

tct gca tea gtc ttc age etc ctt gee ate gee att gag cge tac ate 432 
Ser Ala Ser Val Phe Ser Leu Leu Ala lie Ala lie Glu Arg Tyr lie 
130 135 140 

acc atg ctg aag atg aaa eta cac aae ggg age aac age teg cge tec 480 
Thr Met Leu Lys Met Lys Leu His Asn Giy Ser Asn Ser Ser Arg Ser 
145 150 155 160 

ttt ctg ctg ate age gcc tgc tgg gtc ate tec etc ate ctg ggg ggc 528 
Phe Leu Leu lie Ser Ala Cys Trp Val lie Ser Leu lie Leu Giy Giy 
165 170 175 

ctg ccc age atg ggc tgg aac tgc ate age teg ctg tct age tgc tec 576 
Leu Pro Ser Met Giy Trp Asn Cys lie Ser Ser Leu Ser Ser Cys Ser 
180 185 190 

acc gtg etc ccg etc tac cac aag cac tat att etc ttc tgc acc acc 624 
Thr Val Leu Pro Leu Tyr His Lys His Tyr lie Leu Phe Cys Thr Thr 
195 200 205 

gtc ttc act ctg etc ctg ctt tec ate gcc ate etc tac tgc agg ate 672 
Val Phe Thr Leu Leu Leu Leu Ser He Ala He Leu Tyr Cys Arg He 
210 215 220 

tac tec ttg gtc agg act ega age cge cge ctg acc ttc egc aag aac 720 
Tyr Ser Leu Val Arg Thr Arg Ser Arg Arg Leu Thr Phe Arg Lys Asn 

225 230 235 240 

ate tec aag ggc agt cge agt tct gag aag tct ctg gcc ttg ttg aag 768 
He Ser Lys Giy Ser Arg Ser Ser Glu Lys Ser Leu Ala Leu Leu Lys 
245 250 255 



acg gtg ate att gtc ttg agt gtc ttc att gcc tgc tgg gcc cct etc 
Thr Val He He Val Leu Ser Val Phe He Ala Cys Trp Ala Pro Leu 
260 265 270 



age aaa tea gae aac tec tct cac ccc cag aag gae gat ggg gac aae 
Ser Lys Ser Asp Asn Ser Ser His Pro Gin Lys Asp Asp Giy Asp Asn 
355 360 365 



816 



960 



ttc ate eta eta ctg tta gat gtg ggc tgc aag gcg aag acc tgt gac 864 
Phe He Leu Leu Leu Leu Asp Vai Giy Cys Lys Ala Lys Thr Cys Asp 
275 . 280 285 

ate ctg tac aaa gca gag tac ttc ctg gtt ctg get gtg ctg aac tea 912 
He Leu Tyr Lys Ala Glu Tyr Phe Leu Val Leu Ala Val Leu Asn Ser 
290 295 300 

ggt acc aac ccc ate ate tac act ctg acc aac aag gag atg cge egg 
Glv Thr Asn Pro He He Tyr Thr Leu Thr Asn Lys Giu Met Arg Arg 
305 310 315 320 

gee ttc ate egg ate gta tct tgt tgc aaa tgc ccc aac gga gac tct 1008 
Ala Phe He Arg He Val Ser Cys Cys Lys Cys Pro Asn Giy Asp Ser 
325 330 335 

get ggc aaa ttc aag agg ccc ate ate eca ggc atg gaa ttt age egc 1056 
Ala Giy Lys Phe Lys Arg Pro He He Pro Giy Met Glu Phe Ser Arg 
340 345 350 



1104 
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cca gag acc att atg teg tct gga aac gtc aat tct tct tec taa , 1149 
Pro Giu Thr lie Met Ser Ser Gly Asn Val Asn Set Ser Ser 
370 375 380 



<21C> 5 
<211> 1137 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1) . . (1137) 

<400> 5 

atg gca act gcc etc ccg ccg cgt etc cag ccg gtg egg ggg aac gag 43 

Met Ala Thr Aia Leu Pro Pro Arg Leu Gin Pro Val Arg Gly Asn Glu 

15 10 15 

acc ctg egg gag cat tac cag tac gtg ggg aag ttg gcg ggc agg ctg 96 
Thr Leu Arg Glu His Tyr Gin Tyr Val Gly Lys Leu Ala Gly Arg Leu 
20 25 30 

aag gag gee tec gag ggc age acg etc acc acc gtg etc ttc ttg gtc 144 
Lys Glu Aia Ser Glu Gly Ser Thr Leu Thr Thr Vai Leu Phe Leu Vai 
35 40 45 

ate tgc age ttc arc gtc ttg gag aac ctg atg gtt ttg att gee ate 192 
lie Cys Ser Phe lie Vai Leu Glu Asn Leu Met Vai Leu lie Aia lie 
50 55 60 

tgg aaa aac aat aaa ttt cac aac cgc atg tac ttt ttc att ggc aac 240 
Trp Lys Asn Asn Lys Phe His Asn Arg Met Tyr Phe Phe lie Gly Asn 
65 70 75 80 

ctg get etc tgc gac ctg ctg gcc ggc ate get tac aag gtc aac att 288 
Leu Aia Leu Cys Asp Leu Leu Ala Gly He Aia Tyr Lys Val Asn lie 
85 90 95 

ctg atg tct ggc aag aag acg ttc age ctg tct ccc acg gtc tgg ttc 336 
Leu Met Ser Gly Lys Lys Thr Phe Ser Leu Ser Pro Thr Val Trp Phe 
100 105 110 

etc agg gag ggc agt atg ttc gtg gcc ctt ggg gcg tec ace tgc age 334 
Leu Arg Giu Gly Ser Met Phe Val Aia Leu Gly Aia Ser Thr Cys Ser 
115 120 125 

tta ctg gcc ate gcc ate gag egg cac ttg aca atg ate aaa atg agg 432 
Leu Leu Aia lie Aia He Glu Arg His Leu Thr Met He Lys Met Arg 
130 135 140 

cet tac gac gcc aac aag agg cac cgc gtc ttc etc ctg ate ggg atg 480 
Pro Tyr Asp Aia Asn Lys Arg His Arg Vai Phe Leu Leu lie Gly Met 
145 150 155 160 

tgc tgg etc att gcc ttc acg ctg ggc gcc ctg ccc att ctg ggc tgg 528 
cys Trp Leu lie Aia Phe Thr Leu Gly Ala Leu Pro lie Leu Gly Trp 
165 170 175 

aac tgc ctg cac aat etc cet gac tgc tct ace ate ctg ccc etc tac 576 
Asn Cys Leu His Asn Leu Pro Asp Cys Ser Thr lie Leu Pro Leu Tyr 
180 185 190 
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tec aag aag tac att gcc ttc tgc ate age ate ttc acg gee ate etg 624 
Ser Lys Lys Tyr lie Ala Phe Cys He Ser He Phe Thr Ala He Leu 
195 200 205 



gtg acc ate gtg ate cte tac gca cgc ate tac ttc ctg gtg aag tec 
Vai Thr He Val lie Leu Tyr Ala Arg He Tyr Phe Leu Val Lys Ser 
?10 215 220 



672 



age age egt aag gtg gee aac eae aac aac teg gag egg tee atg gca 720 
Ser Ser Arg Lys Val Ala Asn His Asn Asn Ser Glu Arg Ser Met Ala 
225 230 235 240 

ctg ctg egg acc gtg gtg att gtg gtg age gtg ttc ate gcc tgc tgg 768 
Leu Leu Arg Thr Val Vai He Val Val Ser Val Phe He Ala Cys Trp 
245 250 255 

tec cca etc ttc ate etc ttc etc att gat gtg gcc tgc agg gtg cag 816 
Ser Pro Leu Phe He Leu Phe Leu He Asp Val Ala Cys Arg Val Gin 
260 265 270 

gcg tgc ccc ate etc ttc aag get cag tgg ttc ate gtg ttg get gtg 864 
Ala Cys Pro He Leu Phe Lys Ala Gin Trp Phe He Val Leu Ala Val 
275 280 285 

etc aac tec gcc atg aac ccg gte ate tac aeg ctg gcc age aag gag 912 
Leu Asn Ser Ala Met Asn Pro Val He Tyr Thr Leu Ala Ser Lys Glu 
290 295 300 

atg egg egg gee ttc ttc egt ctg gte tgc aac tgc ctg gtc agg gga 960 
Met Arg Arg Ala Phe Phe Arg Leu Val Cys Asn Cys Leu Val Arg Gly 
305 310 315 320 

egg ggg gee cgc gcc tea ccc ate cag cet gcg etc gac cca age aga 1008 
Arg Gly Ala Arg Ala Ser Pro He Gin Pro Ala Leu Asp Pro Ser Arg 
325 330 335 

agt aaa tea age age age aac aat age age cac tct ccg aag gtc aag 1056 
Ser Lys Ser Ser Ser Ser Asn Asn Ser Ser His Ser Pro Lys Val Lys 
340 345 350 

gaa gac ctg ccc cac aca gac ccc tea tec tgc ate atg gac aag aac 1104 
Glu Asp Leu Pro His Thr Asp Pro Ser Ser Cys He Met Asp Lys Asn 
355 360 365 



gca gca ctt cag aat ggg ate ttc tgc aac tga 1137 
Ala Ala Leu Gin Asn Gly He Phe Cys Asn 
370 375 



<210> 6 

<211> 1734 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (85) . . (1230) 

<400> 6 

ggcacgaggc gccgggccat gggcctcgag cccgccccga acccccgcga gcccgccttg 60 
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tcrgcggcgt gactggaggc ccag atg gtc ate atg ggc cag tgc tac t^c 111 

Met Val :ie Met Gly Gin Cys Tyr Tyr 
1 5 

aac gag acc ate ggc ttc ttc tat aac aac agt ggc aaa gag etc age 159 
Asn Glu Thr lie Gly Phe Phe Tyr Asn Asn Ser Gly Lys Glu Leu Ser 
10 15 20 25 

tec cac tgg egg ccc aag gat gtg gtc gtg gtg gca ctg ggg ctg acc 207 
Ser His Trp Arg Pro Lys Asp Val Val Val Val Ala Leu Gly Leu Thr 
30 35 40 

gtc age gtg ctg gtg ctg ctg ace aat ctg ctg gtc ata gca gee ate 255 
Val Ser Val Leu Val Leu Leu Thr Asn Leu Leu Val lie Ala Ala lie 

45 50 55 

gcc tec aac cgc cgc ttc cac cag ccc ate tac tac ctg etc ggc aat 303 
Ala ser Asn Arg Arg Phe His Gin Pro lie Tyr Tyr Leu Leu Gly Asn 
60 65 70 

ctg gcc gcg get gac etc ttc gcg ggc gtg gcc tac etc ttc etc atg 351 
Leu Ala Ala Ala Asp Leu Phe Ala Gly Val Ala Tyr Leu Phe Leu Met 
75 80 85 

ttc cac act ggt ccc cgc aca gcc cga ett tea ctt gag ggc tgg ttc 
Phe His Thr Gly Pro Arg Thr Ala Arg Leu Ser Leu Glu Gly Trp Phe 
90 95 ICO 105 

ctg egg cag ggc ttg ctg gac aca age etc act gcg teg gtg gee aca 
Leu Arg Gin Gly Leu Leu Asp Thr Ser Leu Thr Ala Ser Val Ala Thr 
110 115 120 

ctg ctg gee ate gcc gtg gag egg cac cgc agt gtg atg gcc gtg cag 
Leu Leu Ala He Ala Val Glu Arg His Arg Ser Val Met Ala Val Gin 
125 130 135 

ctg cac age cgc ctg ccc cgt ggc cgc gtg gtc atg etc att gtg ggc 543 
Leu His Ser Arg Leu Pro Arg Gly Arg Val Val Met Leu He Val Gly 
140 145 150 

gtg tgg gtg get gcc ctg ggc ctg ggg ctg ctg ect gee cac tec tgg 591 
Val Trp Val Ala Ala Leu Gly Leu Gly Leu Leu Pro Ala His Ser Trp 
155 160 165 



cac tgc etc tgt gcc ctg gac cgc tgc tea cgc atg gca ccc ctg etc 
His Cys Leu Cys Ala Leu Asp Arg Cys Ser Arg Met Ala Pi ^ Leu Leu 
170 175 180 185 



gag acc acg etc age ctg gtc aag act gtt gtc ate ate ctg ggg gcg 
Glu Thr Thr Leu Ser Leu Val Lys Thr Val Val He He Leu Gly Ala 
235 240 245 



399 



447 



495 



639 



age cgc tec tat ttg gcc gtc tgg get ctg teg age ctg ctt gtc ttc 637 
Ser Arg Ser Tyr Leu Ala Val Trp Ala Leu Ser Ser Leu Leu Val Phe 
190 195 200 

ctg etc atg gtg get gtg tac acc cgc att ttc ttc tac gtg egg egg 735 
Leu Leu Met Val Ala Val Tyr Thr Arg He Phe Phe Tyr Val Arg Arg 

205 210 215 

cga gtg cag cgc atg gca gag cat gtc age tgc cac ccc cgc tac cga 783 
Arg Val Gin Arg Met Ala Glu His Val Ser Cys His Pro Arg Tyr Arg 
220 225 230 



831 
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ttc gtg gtc tgc tgg aca cca ggc cag gtg gta ctg etc ctg gat. ggt 819 
Phe vai Val Cys Trp Thr Pro Giy Gin Val Val Leu Leu Leu Asp Giy 
250 255 260 265 

tta ggc tgt gag tec tgc aat gtc ctg get gta gaa aag tac ttc eta 927 
Leu Giy cys Giu Ser Cys Asn Val Leu Ala Val Glu Lys Tyr Phe Leu 
270 275 280 

ctg ttg gee gag gee aac tea ctg gtc aat get get gtg tac tct tgc 975 
Leu Leu Ala Glu Ala Asn Ser Leu Val Asn Ala Ala Vai Tyr Ser Cys 
285 290 295 

cga gat get gag atg cgc cgc ace ttc cgc cgc ctt etc tgc tgc gcg 1023 
Arg Asp Ala Giu Met Arg Arg Thr Phe Arg Arg Leu Leu Cys Cys Ala 

300 305 310 

tgc etc cgc cag tee ace cgc gag tct gtc cac tat aca tec tct gcc 1071 
Cys Leu Arg Gin Ser Thr Arg Glu Ser Val His Tyr Thr Ser Ser Ala 
315 320 325 



cag gga ggt gee age act cgc ate atg ctt ecc gag aac ggc cac cca 
Gin Giy Giy Ala Ser Thr Arg He Met Leu Pro Glu Asn Giy His Pro 
330 335 340 345 



1119 



ctg atg act cca cec ttt age tac ctt gaa ctt cag egg tac gcg gca 1167 
Leu Met Thr Pro Pro Phe Ser Tyr Leu Glu Leu Gin Arg Tyr Ala Ala 
350 355 360 

age aac aaa tee aca gcc cet gat gac ttg tgg gtg etc ctg get caa 1215 
Ser Asn Lys Ser Thr Ala Pro Asp Asp Leu Trp Vai Leu Leu Ala Gin 
365 370 375 

cec aac caa cag gae tgactgactg gcaggacaag gtctggeatg gcaeagcacc 1270 
Pro Asn Gin Gin Asp 
380 



actgceaggc 


ctccecagge 


acaceaetct 


gcccagggaa 


tgggggcttt 


gggtcatctc 


1330 


ecactgeetg 


ggggagtcag 


atggggtgca 


ggaatetggc 


tettcagcca 


tetcaggttt 


1390 


agggggtttg 


taacagaeat 


tattctgttt 


tcaetgegta 


tecttggtaa 


gecetgtgga 


1450 


ctggttcctg 


ctgtgtga tg 


ctgagggttt 


taaggtgggg 


agagataagg 


gctctctcgg 


1510 


gccatgctac 


ccggtatgae 


tgggtaatga 


ggacagactg 


tggaeacccc 


atetaectga 


1570 


gtctgattct 


ttagcageag 


agactgaggg 


gtgcagagtg 


tgagctggga 


aaggtttgtg 


1630 


gctccttgca 


geetceaggg 


aetggeetgt 


ccccaataga 


attgaagcag 


tecaegggga 


1690 


ggggatgata 


caaggagtaa 


acctttcttt 


acactcaaaa 


aaaa 




1734 



<210> 7 
<211> 1059 
<212> DNA 
<213> Rattus sp. 

<220> 

<221> CDS 

<222> (1) . . (1056) 
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<400> 7 



atg ggc ggt tta tac tea gag tac etc aat cct gag aag gtt cag gaa 
Met Giy Gly Leu Tyr Ser Glu Tyr Leu Asn Pro Glu Lys vai Gin Giu 
15 10 15 



48 



cac tac aat tac acc aag gag acg ctg gac atg cag gag acg ccc tec 96 

His Tyr Asn Tyr Thr Lys Glu Thr Leu Asp Met Gin Glu Thr Pro Ser 
20 25 30 

cge aag gtg gee tec gee tte ate ate att tta tgc tgt gcc ate gtg 144 

Arg Lys Val Ala Ser Ala Phe He He He Leu Cys Cys Ala He Val 
35 40 45 

gtg gag aae ctt ctg gtg eta ate gca gtg gcc agg aac age aag tte 192 

Val Glu Asn Leu Leu Val Leu He Ala Val Ala Arg Asn Ser Lys Phe 
50 55 60 



eac tea gcc atg tac ctg tte etc ggc aac ctg gca gcc tec gac ctg 
His Ser Ala Met Tyr Leu Phe Leu Giy Asn Leu Ala Ala Ser Asp Leu 
65 70 75 80 



240 



ctg gca ggc gtg gcc tte gtg gcc aac ace ttg etc tec gga cct gte 288 
Leu Ala Gly Val Ala Phe Val Ala Asn Thr Leu Leu Ser Gly Pro Val 

85 90 95 

acc ctg tec tta act ccc ttg cag tgg ttt gcc cga gag ggt tea gcc 336 
Thr Leu Ser Leu Thr Pro Leu Gin Trp Phe Ala Arg Glu Gly Ser Ala 
100 105 110 

tte ate acg etc tct gcc teg gte tte age etc ctg gcc att gcc ate 384 
Phe He Thr Leu Ser Ala Ser Val Phe Ser Leu Leu Ala He Ala He 
115 120 125 

gag aga caa gtg gee ate gee aag gte aag etc tac ggc agt gac aaa 432 
Glu Arg Gin Vai Ala He Ala Lys Val Lys Leu Tyr Gly Ser Asp Lys 
130 135 140 

age tgt cga atg ttg atg etc att ggg gcc tct tgg ctg ata teg ctg 480 
Ser Cys Arg Met Leu Met Leu He Gly Ala Ser Trp Leu He Ser Leu 
145 150 155 160 

att ctg ggt ggc ttg ccc ate ctg ggc tgg aat tgt ctg gac cat ctg 528 
He Leu Gly Giy Leu Pro He Leu Gly Trp Asn Cys Leu Asp His Leu 
165 170 175 

gag get tgc tec act gtg ctg ccc etc tat get aag cac tat gtg etc 576 
Glu Ala Cys Ser Thr Val Leu Pro Leu Tyr Ala Lys His Tyr Vai Leu 
180 185 190 

tgc gtg gte acc ate tte tct gte ate tta ctg get ate gtg gcc ttg 624 
Cvs Val Val Thr He Phe Ser Vai He Leu Leu Ala He Vai Ala Leu 
195 200 205 

tac gte cga ate tac tte gta gte cge tea age cat geg gac gtt get 672 
Tyr Val Arg He Tyr Phe Val Val Arg Ser Ser His Ala Asp Val Ala 
210 215 220 

ggt cct cag acg ctg gcc ctg etc aag aca gte ace ate gta ctg ggt 720 
Gly Pro Gin Thr Leu Ala Leu Leu Lys Thr Val Thr He Val Leu Gly 
225 230 235 240 
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gtt ttc ate ate tgc tgg ctg ccg get ttt age ate ett etc tta* gac 768 

Vai Phe He He Cys Trp Leu Pro Ala Phe Ser He Leu Leu Leu Asp 

245 250 255 

tct ace tgt ecc gtc egg gee tgt ect gtc etc tac aaa gee cat tat 816 

Ser Thr Cys Pro Val Arg Ala Cys Pro Vai Leu Tyr Lys Ala His Tyr 
260 265 270 



ttc ttt gee ttc gee acc etc aae tct ctg etc aac ect gtc ate tat 
Phe Phe Ala Phe Ala Thr Leu Asn Ser Leu Leu Asn Pro Val He Tyr 
275 280 285 



364 



aca tgg cgt age egg gae ett egg agg gag gta etg agg ccc ctg ctg 912 
Thr Trp Arg Ser Arg Asp Leu Arg Arg Glu Val Leu Arg Pro Leu Leu 
290 295 300 



tgc tgg egg eag ggg aag gga gca aca ggg cgc aga ggt ggg aac ect 
Cys Trp Arg Gin Gly Lys Gly Ala Thr Gly Arg Arg Giy Gly Asn Pro 
305 310 315 320 



ttg cat dtg ect aca teg cca aca ttt ctg gag gge aae aca gtg gtc 

Leu Hls Met Pro Thr Ser Pro Thr Phe Leu Glu Gly Asn Thr Val Val 
340 345 350 

tga 



960 



ggt cae cga etc ctg ccc etc cgc age tec age tec ctg gag aga gge 1008 
Gly His Arg Leu Leu Pro Leu Arg Ser Ser Ser Ser Leu Glu Arg Gly 
325 330 335 



1056 



1059 



<210> 8 
<2il> 1122 
<212> DNA 

<213> Murinae gen. sp . 

<220> 

<221> CDS 

<222> ( 1} . . (1119) 

<400> 8 

atg gee tgt aac age aca ccc atg ggg act tae gaa cat ctg etg ctg 48 

Met Ala Cys Asn Ser Thr Pro Met Gly Thr Tyr Glu His Leu Leu Leu 
15 10 15 

aat gtg age aac act ttg gac ect ggg gac acc cca ctg tct gca ccg 96 
Asn Val Ser Asn Thr Leu Asp Pro Gly Asp Thr Pro Leu Ser Ala Pro 
20 25 30 

etc agg ate teg etg gca ata atg atg etg ctg atg ate gtg gta gga 144 
Leu Arg He Ser Leu Ala He Met Met Leu Leu Met He Val Val Gly 
35 40 45 

ttc ett gge aac acg gtg gtc tgc ate ate gtg tac eag agg cca gee 192 
Phe Leu Gly Asn Thr Val Val Cys He He Val Tyr Gin Arg Pro Ala 
50 55 60 

atg cgt tea get ate aac ctg ctg ctg gee acc ttg gee ttc tec gac 240 
Met Arg Ser Ala He Asn Leu Leu Leu Ala Thr Leu Ala Phe Ser Asp 

65 70 75 80 
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ate atg ctg tct tta tgc tgc atg cca ttc acg gcc ate acc eta ate 288 
lie Met Leu Ser Leu Cys Cys Met Pro Phe Thr Ala lie Thr Leu lie 

85 90 95 

act gtt cgc tgg cat ttc ggg gac cac ttt tgt egg ctt tea get act 336 
Thr Val Arg Trp His Phe Gly Asp His Phe Cys Arg Leu Ser Aia Thr 
IOC 105 110 

etc tat tgg ttt ttt gtc eta gag ggc gtg gee ate ctg etc ate att 384 
Leu Tyr Trp Phe Phe Vai Leu Glu Gly Vai Aia lie Leu Leu lie lie 
115 120 125 

age gtg gac egg ttt etc ate ate gtg cag cgt cag gac aag ctg aac 432 
Ser Val Asp Arg Phe Leu lie He Vai Gin Arg Gin Asp Lys Leu Asn 
130 135 140 

cca cgc agg get aag atg ate ate gcg gee tec tgg gtg ctg tct ttc 480 
Pro Arg Arg Aia Lys Met lie He Aia Aia Ser Trp Vai Leu Ser Phe 
145 150 155 160 

tgc ate tct gcg ccc tec ttc act ggc tgg acg ttc atg gag gtg cet 528 
Cys lie Ser Aia Pro Ser Phe Thr Gly Trp Thr Phe Met Glu Val Pro 
165 170 175 

get cga gcc cca cag tgc gtg eta ggc tac act gag ttc cca get gaa 576 
Aia Arg Aia Pro Gin Cys Vai Leu Gly Tyr Thr Glu Phe Pro Aia Glu 
180 185 190 

cgc gcc tat gta gtg aca ctg gtg gtg gca gtg ttc ttt get ccc ttc 624 
Arg Aia Tyr Vai Val Thr Leu Val Val Aia Vai Phe Phe Aia Pro Phe 

195 200 205 

ggc gtc atg ttg tgc tec tat ctg tgc ate etc aat acg gtg egg aag 672 
Gly Vai Met Leu Cys Ser Tyr Leu Cys He Leu Asn Thr Vai Arg Lys 
210 215 220 

aac get gtc cgt gtg cac aac cag teg gac age ctg gac etc aga cag 720 
Asn Aia Val Arg Vai His Asn Gin Ser Asp Ser Leu Asp Leu Arg Gin 
225 230 235 240 

ctg acc gga get ggc ctg aga cgt etc aga egg cag cag cag cag gcc 768 
Leu Thr Gly Aia Gly Leu Arg Arg Leu Arg Arg Gin Gin Gin Gin Aia 
245 250 255 

age ctg gac ctg agt ttc aaa acc aag gcc ttc acc acc ate etc ate 816 
Ser Leu Asp Leu Ser Phe Lys Thr Lys Aia Phe Thr Thr lie Leu lie 
260 265 270 

etc ttc gtg ggc ttt tea etc tgc tgg ctg cca cac tea gtc tac age 864 
Leu Phe Val Gly Phe Ser Leu Cys Trp Leu Pro His Ser Vai Tyr Ser 
275 280 285 

ctg ctg tct gcg ttc age egg egg ttc tat tac age gcc tec ttc tac 912 
Leu Leu Ser Ala Phe Ser Arg Arg Phe Tyr Tyr Ser Ala Ser Phe Tyr 
290 295 300 

ace acc age aca tgc gtc ctg tgg etc agt tac etc aag tct gtt ttc 960 
Thr Thr Ser Thr Cys Val Leu Trp Leu Ser Tyr Leu Lys Ser Vai Phe 
305 310 315 320 

aac ccc ate gtc tac tgc tgg agg ate aag aaa ttc cgc gag gcc tgc 1008 
Asn Pro lie Val Tyr Cys Trp Arg He Lys Lys Phe Arg Glu Aia Cys 
325 330 335 
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ata gag ttg ctt ccc cac act ttc caa ate etc cct aaa gtg ccr, gag 1056 

lie Glu Leu Leu Pro His Thr Phe Gin lie Leu Pro Lys Val Pro Glu 
340 345 350 

egg ate cag agg aaa ate cag cca age acc ate tat gtg tgc aac gaa 1104 

Arg lie Gin Arg Lys lie Gin Pro Ser Thr lie Tyr Val Cys Asn Glu 
355 360 365 

aac caa tec get gtc tag 1122 

Asn Gin Ser Ala Val 
370 



<210> 9 
<211> 373 
<212> PRT 
<213> Mus sp. 

<400> 9 

Met Ala Cys Asn Ser Thr Pro Met Gly Thr Tyr Glu His Leu Leu Leu 
15 10 15 

Asn Val Ser Asn Thr Leu Asp Pro Gly Asp Thr Pro Leu Ser Ala Pro 

20 25 30 

Leu Arg lie Ser Leu Ala lie Met Met Leu Leu Met lie Val Val Gly 
35 40 45 

Phe Leu Gly Asn Thr Val Val Cys He He Val Tyr Gin Arg Pro Ala 
50 55 60 

Met Arg Ser Ala He Asn Leu Leu Leu Ala Thr Leu Ala Phe Ser Asp 
65 70 75 80 

lie Met Leu Ser Leu Cys Cys Met Pro Phe Thr Ala He Thr Leu He 
85 90 95 

Thr Val Arg Trp His Phe Gly Asp His Phe Cys Arg Leu Ser Ala Thr 
100 105 110 

Leu Tyr Trp Phe Phe Val Leu Glu Gly Val Ala He Leu Leu He He 
115 120 125 

Ser Val Asp Arg Phe Leu He He Val Gin Arg Gin Asp Lys Leu Asn 
130 135 140 

Pro Arg Arg Ala Lys Met He He Ala Ala Ser Trp Val Leu Ser Phe 
145 150 155 160 

Cys He Ser Ala Pro Ser Phe Thr Gly Trp Thr Phe Met Glu Val Pro 
165 170 175 

Ala Arg Ala Pro Gin Cys Val Leu Gly Tyr Thr Glu Phe Pro Ala Glu 
180 185 190 

Arg Ala Tyr Val Val Thr Leu Val Val Ala Val Phe Phe Ala Pro Phe 
195 200 205 

Gly Val Met Leu Cys Ser Tyr Leu Cys He Leu Asn Thr Val Arg Lys 
210 215 220 
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Asn Ala Val Arg Val His Asn Gin Ser Asp Ser Leu Asp Leu Arc. Gin 

225 230 235 240 

Leu ?hr Gly Ala Gly Leu Arg Arg Leu Arg Arg Gin Gin Gin Gin Ala 
245 250 255 

Ser Leu Asp Leu Ser Phe Lys Thr Lys Ala Phe Thr Thr He Leu lie 

260 265 270 

Leu Phe Val Gly Phe Ser Leu Cys Trp Leu Pro His Ser Val Tyr Ser 
275 280 285 

Leu Leu Ser Ala Phe Ser Arg Arg Phe Tyr Tyr Ser Ala Ser Phe Tyr 
290 295 300 

Thr Thr Ser Thr Cys Val Leu Trp Leu Ser Tyr Leu Lys Ser Val Phe 
305 310 315 320 

Asn Pro He Val Tyr Cys Trp Arg He Lys Lys Phe Arg Glu Ala Cys 
325 330 335 

He Glu Leu Leu Pro His Thr Phe Gin He Leu Pro Lys Val Pro Glu 
340 345 350 

Arg He Gin Arg Lys He Gin Pro Ser Thr He Tyr Val Cys Asn Glu 
355 360 365 

Asn Gin Ser Ala Val 
370 



<210> 10 

<211> 35 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 10 

gcgcgggatc caccatgggg cccaccagcg tcccg 



<210> 11 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 11 

gcgcggtcga cggaagaaga gttgacgttt cc 32 



<210> 12 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
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<400> 12 

gcgcgggatc caccatggca actgcccrcc eg 



32 



<210> 13 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gcgcggtcga ctcagttgca gaagatccca ttc 33 



<210> 14 
<211> 34 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; primer 



<210> 15 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 15 

agttcactcg agtcagtcct gttggttggg ttg 



<210> 16 
<211> 35 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 16 

gcgcgggatc caccatgggc ggtttatact cagag 35 



<21C> 17 
<211> 30 
<212> DNA 

<:i3> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 17 

gcgcggtcga ctcagaccac tgtgttgccc 



<400> 14 

atcagcggat ccaccatggt catcatgggc cagt 



34 
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<210> 18 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
<400> 18 

gcgctctaga ccaccatggc ctgtaacagc acaccc 36 



<210> 19 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 19 

gcgcgtcgac ctagacagcg gattggtttt eg 
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